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CaaX Prenyl Protease Nucleic Acids and Polypeptides and Methods of 

Use Thereof 

Field of the invention 

The invention relates to novel plant CaaX prenyl protease polynucleotides and 
5 polypeptides. Also included are transgenic plants expressing the novel polynucleotides 
and polypeptides. Also included are transgenic plant cells, tissues and plants having 
novel phenotypes resulting from the expression of these polynucleotides in either the 
sense or antisense orientation. 



i o Background of the invention 

Most higher plants encounter at least transient decreases in relative water content 
at some stage of their life cycle and, as a result, have evolved a number of desiccation 
protection mechanisms. If however, the change in water deficit is prolonged the effects 
on the plants growth and development can be profound. Decreased water content due to 
15 drought, cold or salt stress can irreparably damage plant cells which in turn limits plant 
growth and crop productivity in agriculture. 

Plants respond to adverse conditions of drought, salinity and cold with a variety 
of morphological and physiological changes. Although our understanding of plant 
tolerance mechanisms to these stresses is incomplete, the plant hormone abscisic acid 
20 (ABA) is believed to be an essential mediator between environmental stimulus and plant 
responses. ABA levels increase in response to water deficits and exogenously applied 
ABA mimics many of the responses induced by water-stress. Once ABA is synthesized it 
causes the closure of the leaf stomata thereby decreasing water loss through 
transpiration. 

25 The identification of genes that transduce ABA into a cellular response opens the 

possibility of exploiting these regulators to enhance desiccation tolerance in crop species. 
In principle, these ABA signaling genes can be coupled with the appropriate controlling 
elements to allow optimal plant growth, development and productivity. Thus, not only 
would these genes allow the genetic tailoring of crops to withstand transitory 

30 environmental stresses, but they should also broaden the environments where traditional 
crops can be grown. 
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The recent isolation of an Arabidopsis mutant, eral, is hypersensitive to ABA 
and has been shown to also be tolerant to conditions of water deprivation. ERA1 has 
been identified as a P subunit of farnesyl transferase knockout mutant in. Farnesyl 
transferase is a heterodimeric enzyme that provides the specific addition of a farnesyl 
5 pyrophosphate moiety onto the substrate target sequence. The target sequence is defined 
as a sequence of four amino acids which are present at the carboxy terminus of the 
protein and is referred to as a CaaX motif in which the "C" is cysteine, "a" is any 
aliphatic amino acid and "X" is any amino acid. The a subunit is common with a second 
prenylation enzyme, geranylgeranyl transferase, that has a different p subunit and adds a 
10 geranylgeranyl isoprenyl pyrophosphate moiety to the target sequence. 

Prenylation is a multistep pathway which includes prenylation of the cysteine 
residue of the CaaX site, cleavage of the -aaX tripeptide and methylation of the prenyl- 
cysteine residue. Potentially, each of these steps could represent a target for genetic 
manipulation of the prenylation process to generate a desired phenotype such as stress 
15 tolerance. 

In plants, prenylation has been linked to cell cycle control, meristem 
development, and phytohormone signal transduction, however, few details of the role of 
prenylation, the substrate proteins or the extent to which the plant system will be 
analogous to the mammalian and yeast systems are known. The most characterized 
20 substrates for CaaX modification are the Ras and a-factor proteins of yeast. Although 

there are three steps to complete protein maturation, abolition or modification of any one 
step does not necessarily result in cessation of target biological activities. Ras function is 
attenuated if the -aaX tripeptide is not cleaved but not abolished and some proteins retain 
the -aaX tripeptide after farnesylation. 

25 In Arabidopsis, more than 600 proteins contain a CaaX motif, suggesting a role 

for the post-translational modification by prenylation in numerous cellular processes. In 
Arabidopsis, it has been demonstrated that the loss-of-function of the p-subunit of 
farnesyl transferase will result in a ABA-hypersensitive phenotype. Although it is still 
not clear why plants lacking the functional p-subunit of farnesyl transferase become 

30 more sensitive to ABA, it clearly suggests that protein prenylation is involved in 
regulation of the homeostasis of ABA sensitivity. The balance of ABA cellular 
responses, whether more sensitive or less sensitive to ABA, is possibly regulated by the 
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relative activities of prenylated proteins. The changes in AtCPP expression and gene 
activity may affect the activity of two pools of genes, one pool acting as positive 
regulators (pool A) and the second pool (pool B) as negative regulators, which require 
prenylation in order to function properly. Pool A may contain genes that can promote 

5 ABA sensitivity, and pool B genes that may reduce ABA sensitivity. The homeostasis of 
ABA sensitivity may therefore governed by the ratio of activity of pool A to pool B. For 
example, in the case of up-regulation of AtCPP in Arabidopsis, the activity ratio of pool 
A over pool B may be increased due to difference in substrate affinity of pool A proteins 
toward AtCPP, thus the homeostasis of ABA sensitivity is changed, and the AtCPP over- 

10 expression plants are more sensitive to ABA. 

This invention is directed at the manipulation of the CaaX prenyl protease 
enzyme (CPP), which catalyses the proteolytic cleavage of the -aaX tripeptide in the 
second step of the prenylation process. Included in this invention aire vector constructs 
containing CPP sequence under the control of appropriate regulatory sequences to 
1 5 produce a water-stress tolerant phenotype. 



Summary of the invention 

The present invention is based in part upon the discovery of novel CaaX prenyl 
protease (CPP) nucleic acid sequences and polypeptides isolated from Arabidopsis 
thaliana, Brassica napus and Glycine max. The nucleic acids, polynucleotides, proteins 

20 and polypeptides, or fragments thereof described herein are collectively referred to as 
CPP nucleic acids and polypeptides. 

Accordingly, in one aspect, the invention provides an isolated nucleic acid 
molecule that includes the sequence of SEQ ID NO:l, SEQ ID NO:14, or SEQ ID NO:17 
or fragment, homolog, analog or derivative thereof. The nucleic acid can include, e.g., a 

25 nucleic acid sequence encoding a polypeptide at least 99% identical to a polypeptide that 
includes the amino acid sequences of SEQ ID NO:2, SEQ ID NO: 15, or SEQ ID NO: 18 
or a nucleic acid sequence encoding a polypeptide at least 96% identical to a polypeptide 
that includes the amino acid sequences of SEQ ID NO: 15. In yet another aspect, the 
invention provides a nucleic acid that includes the sequence of SEQ ID NO: 68, 70, 72 

30 or 74. The nucleic acid can be, e.g., a genomic DNA fragment, or a cDNA molecule. 

Preferably, the nucleic acid is naturally occurring. The invention also provides a nucleic 
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acid sequence that is complementary to the nucleic acid sequence of SEQ ID NO:l, SEQ 
ID NO:14, or SEQ ID NO:17. For example, SEQ ID NO: 16, 19 or 20. 

Also included in the invention is a vector containing one or more of the nucleic 
acids described herein, and a cell containing the vectors or nucleic acids described 
5 herein. In various aspects the vector comrprises the nucleic acid sequences of SEQ ID 
NO: 4, 5, 36-53. 

The invention is also directed to host cells transformed with a vector comprising 
any of the nucleic acid molecules described above. 

The invention is also directed to plants and cells transformed with a CPP nucleic 
10 acid or a vector comprising a CPP nucleic acid. Also included in the invention is the 
seed, and progeny of the transformed plants or cells. 

In a further aspect, the invention includes a substantially purified CPP 
polypeptide, e.g., any of the CPP polypeptides encoded by an CPP nucleic acid, and 
fragments, homologs, analogs, and derivatives thereof. Accordingly, in one aspect, the 
15 invention provides an isolated polypeptide molecule that includes the sequence of SEQ 
ID NO:2, SEQ ID NO: 15, or SEQ ID NO: 18. 

In yet another aspect the invention provides a polypeptides that includes the 
sequence of SEQ ID NO: 69, 71, 73 or 75. 

In still a further aspect, the invention provides an antibody that binds specifically 
20 to an CPP polypeptide. The antibody can be, e.g., a monoclonal or polyclonal antibody, 
and fragments, homologs, analogs, and derivatives thereof. The invention is also directed 
to isolated antibodies that bind to an epitope on a polypeptide encoded by any of the 
nucleic acid molecules described above. 

The invention also includes a method of producing a transgenic plant which has 
25 an altered phenotype such as, but not limited to, increased tolerance to stress, delayed 
senescence, increased ABA sensitivity, increased yield, increased productivity and 
increased biomass compared to a wild type plant by introducing into one or more cells of 
a plant a compound that alters (e.g., increases or decreases) CPP expression or activity in 
the plant. In one aspect the compound is a CPP nucleic acid or polypeptide. In one 
30 emodiment the nucleic acid is an inhibitor or farnesylation. For example, the compound 
comprises SEQ ID NO: 1, 14, 17, 68, 70, 72, 74, 21, 23, 25, 27, 29, 31, 33, 2, 15, 18, 22, 
24 26, 28, 30, 32 34, 69, 71, 73, or 75. Alternatively, the compound is a CPP double 
stranded RNA-inhibition hair-pin nucleic acid or CPP antisense nucleic acid, such as for 
example, SEQ ID NO: 16, 19, 20, 5, 35, 37, 42, 45, 46, 48, 49, 51 or 51. 
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The invention further provides a method for producing a CPP polypeptide by 
providing a cell containing an CPP nucleic acid, e.g., a vector that includes a CPP 
nucleic acid, and culturing the cell under conditions sufficient to express the CPP 
polypeptide encoded by the nucleic acid. The expressed CPP polypeptide is then 
5 recovered from the cell. Preferably, the cell produces little or no endogenous CPP 
polypeptide. The cell can be, e.g., a prokaryotic cell or eukaryotic cell. 

The invention is also directed to methods of identifying a CPP polypeptide or 
nucleic acid in a sample by contacting the sample with a compound that specifically 
binds to the polypeptide or nucleic acid, and detecting complex formation, if present. 

10 The invention further provides methods of identifying a compound that modulates the 
activity of a CPP polypeptide by contacting a CPP polypeptide with a compound and 
determining whether the CPP polypeptide activity is modified. 

The invention is also directed to compounds that modulate CPP polypeptide 
activity identified by contacting a CPP polypeptide with the compound and determining 

15 whether the compound modifies activity of the CPP polypeptide, binds to the CPP 
polypeptide, or binds to a nucleic acid molecule encoding a CPP polypeptide. 

Unless otherwise defined, all technical and scientific terms used herein have the 
same meaning as commonly understood by one of ordinary skill in the art to which this 
invention belongs. Although methods and materials similar or equivalent to those 

20 described herein can be used in the practice or testing of the present invention, suitable 
methods and materials are described below. All publications, patent applications, 
patents, and other references mentioned herein are incorporated by reference in their 
entirety. In the case of conflict, the present specification, including definitions, will 
control. In addition, the materials, methods, and examples are illustrative only and not 

25 intended to be limiting. 

Other features and advantages of the invention will be apparent from the 
following detailed description and claims. 

Brief description of the Drawings 

30 Figure 1. is a schematic representation of the vector constructs; A) pBI121-AtCPP, B) 
pBI121-antisense-AtCPP, C) pBI121-HP-AtCPP. 
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Figure 2. is an illustration of (A) nucleic acid and (B) amino acid sequence identities as 
determined by ClustalW analysis. 

Figure 3. is a scan of a typical Southern blot of transgenic Arabidopsis Tl lines carrying 
the pBI121-AtCPP construct. 

5 Figure 4. is a scan of a typical Southern blot of transgenic Arabidopsis T3 lines carrying 
the pBI 121 -HP- AtCPP construct. 

Figure 5. is a scan of a typical Southern blot of transgenic Arabidopsis lines carrying the 
pRD29A-AtCPP construct. 

Figure 6. is a scan of a typical Southern blot of transgenic Arabidopsis lines carrying the 
1 0 pRD29 A-HP- AtCPP construct. 

Figure 7 is an illustration showing the relative expression of AtCPP mRNA transcript 
(solid bars) and AtCPP protein levels (stippled bars) in several pBIl 21 -AtCPP 
transgenic lines. 

Figure 8. is a histogram showing the percentage of lines which were categorized as 
15 ABA sensitive, moderately ABA sensitive or ABA insensitive. Seedlings were 

assessed on agar plates containing 1 \\M ABA and scored at 21 days growth. 
Thirty-six lines of the pBI121 -AtCPP over-expression construct were assessed at 
21 days by leaf and seedling development. Thirty-two lines of the 35S-HP- 
AtCPP down-regulation construct were assessed at 21 days for leaf and seedling 
20 development. Each line was assessed by plating approximately 100 seeds per 

plate and the seedlings scored and recorded as the percent insensitive seedlings 
per plate. Each line was then expressed as a percent of wild type (Wt). Lines were 
categorized as sensitive (less than 1% of Wt) solid bars, intermediate (1-50% of 
Wt) diagonally lined or insensitive (greater than 50% of Wt) stippled, based on 
25 their relationship to Wt and the percentage of each category plotted as a 

histogram. 

Figure 9. is an illustration showing the response of wild type and a pRD29A-HP- AtCPP 
transgenic line to various concentrations of ABA in two week old seedlings. 

Figure 10. is a histogram showing the analysis of transgenic plants containing the 
30 pBI121-AtCPP over-expression construct, (SEQ ID NO:4). Water loss per gram 

shoot dry weight after four days of water stress treatment. Lines that are marked 
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with a star are those which were strongly ABA sensitive. Lines marked with a 
triangle are moderately ABA sensitive. Bars represent means of eight replicates. 
Lines marked with a filled dot above the bar represents lines which were 
significantly different from control at a p=0.05 value. 

5 Figure 1 1 . is a histogram showing seed yield in grams of transgenic Arabidopsis lines of 
pBI121-AtCPP grown under optimal water conditions 

Figure 12. is a bar chart howing growth and yield of transgenic Arabidopsis lines of 
pBI121-AtCPP grown under optimal watering conditions plus a biotic stress 
condition. Tields as a % of wild type, rosette leaf number, rosette leaf fresh 
10 weight and shoot dry weight are plotted. 

Figure 13. are photographs showing rowth of transgenic Arabidopsis lines of pBI121- 
AtCPP grown on agar plates. Changes to root growth visible. 

Figure 14. is a bar chart showing rowth of transgenic Arabidopsis lines of pRD29A-HP- 
AtCPP grown under optimal watering conditions. Rosette leaf number, rosette 
15 leaf dry weight and shoot dry weight are plotted. 



Detailed description of invention 

The present invention provides novel CaaX prenyl protease (CPP) nucleic acid 
20 sequences (SEQ ID No:l, SEQ ID NO:14 and SEQ ID NO:17) the encoded 

polypeptides: SEQ ID NO:2, SEQ ID NO: 15 and SEQ ID NO: 18) isolated from 
Arabidopsis thaliana (At) Brassica napus (Bn) and Glycine Max (Gm) respectively. The 
invention also provides CaaX prenyl protease antisense nucleic acids. (SEQ ID NO: 16, 
SEQ ID NO: 19 and SEQ ID NO:20). The sequences are collectively referred to as "CPP 
25 nucleic acids", CPP polynucleotides" or "CPP antisense nucleic acids" and the 

corresponding encoded polypeptide is referred to as a "CPP polypeptide" or "CPP 
protein". Unless indicated otherwise, "CPP" is meant to refer to any of the novel 
sequences disclosed herein. Table A below summarizes the nucleic acids and 
polypeptides according to the invention 

30 Table A 



SEQ ID NO. 


SEQ 


Type 




Species 










Transformed 
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1 


AtCPP 


NA 


PCR 




2 


AtCPP 


AA 


Translation 




3 


At- AFC 1 


AA 


Ref. 




4 


dBI 121 -AtCPP 


NA 


Construct 


At, Bn 


5 


dB 1 1 2 1 -HP- AtCPP 


NA 


Construct 


At 


6 


AtCPP BamFW 


NA 


Primer 




7 


AtCPP SmaRV 


NA 


Primer 




g 


AtCPP-HP-SacFW 

XXL A X XXX k-/ C+V/ A. IT 


NA 


Primer 




9 


AtCPP-HP-SacRV 


NA 


Primer 




10 


pBI 121 -AtCPP Forward 


NA 


Primer 




11 


dBI 1 2 1 -anti AtCPP-SmaF W 

VJ 3-J X A X^ A UllLliVlVy/A X C111U1 IT 


NA 


Primer 




12 


dBI 1 2 1 -anti AtCPP-BamRV 


NA 


Primer 




13 


D35S-HP-AtCPP Reverse 


NA 


Primer 




14 


BnCPP 


NA 


PCR 




15 


BnCPP 


AA 


Translation 

A A CllliJlLlLlwll 




16 


BnCPP antisense 

1— J 1 1 \ — " A X lilllUvlUV 


NA 


PCR 




17 


GmCPP 


NA 


PCR 




18 


GmCPP 


AA 


Translation 




19 


GmCPP antisense 


NA 


PCR 




20 


AtCPP antisense 

ill V/A X LA-I 111 Jvll jv 


NA 


PCR 




21 


BASF- ATI 

IJiliJl 1 X X A 


NA 


Ref. 




22 


BASF- ATI 


AA 


Ref. 




23 


BASF-AT2 


NA 


Ref. 




24 


BASF-AT2 


AA 


Ref. 




25 


BASF-Corn 

A—/ J V LJ A WV/1 11 


NA 


Ref. 




26 


BASF-Corn 


AA 


Ref. 




27 


BASF-Sov 


NA 


Ref. 




28 


BASF-Sov 


AA 


Ref. 




29 


AFC1 


NA 


Ref. 




30 


AFC1 


AA 


Ref. 




31 


AT4e01320 


NA 


Ref. 




32 


AT4&01320 


AA 


Ref. 




33 


AF007269 


NA 


Ref. 




34 


AF007269 


AA 


Ref. 




35 


dBI 12 1 -antisense-AtCPP 

UAVl lir 1 Ul A H JvHJv J iLVl X 


NA 


Construct 




36 


nRD29A-AtCPP 


NA 


Construct 


At, Bn 


37 


pRD29A-HP-AtCPP 


NA 


Construct 


At 


38 


pRD29A-antisense-AtCPP 


NA 


Construct 




39 


MuA-AtCPP 

l"AUi X / VI wl X 


NA 


Construct 


Gm, Zm 


40 


MuA-GmCPP 


NA 


Construct 




41 


pBI 121 -GmCPP 




Construct 




42 


pBI 1 2 1 -HP-GmCPP 




Construct 




43 


pBI 121 -antisense-GmCPP 




Construct 




44 


pRD29A-GmCPP 




Construct 




45 


pRD29A-HP-GmCPP 




Construct 




46 


pRD29A-antisense-GmCPP 




Construct 




47 


pBI 121 -BnCPP 




Construct 




48 


pBI121-HP-BnCPP 




Construct 




49 


pBI121-antisense-BnCPP 




Construct 




50 


pRD29A-BnCPP 




Construct 




51 


pRD29A-HP-BnCPP 




Construct 




52 


pRD29A-antisense-BnCPP 




Construct 




53 


MuA-BnCPP 




Construct 




54 


GmCPP SmaFW 




Primer 




55 


GmCPP SacRV 




Primer 




56 


BnCPP-anti-SmaFW 




Primer 




57 


BnCPP-anti-BamRV 




Primer 




58 


BnCPP-HP-Sac-FW 




Primer 





8 



WO 03/012116 



PCT/IB02/03887 



59 


BnCPP-HP-Sac-RV 




Primer 




60 


BnCPP-HP-BamFW 




Primer 




61 


BnCPP-HP-XbaRV 




Primer 




62 


GmCPP-HP-Sac-FW 




Primer 




63 


GmCPP-HP-Sac-RV 




Primer 




64 


GmCPP-HP-BamFW 




Primer 




65 


GmCPP-HP-XbaRV 




Primer 




66 


pRD29AP 




Primer 




67 


Nosterm-RV 




Primer 




68 


Consensus- BASF 


NA 






69 


Consensus- BASF 


AA 






70 


Consensus- Generic 


NA 






71 


Consensus- Generic 


AA 






72 


Consensus- PPI 


NA 






73 


Consensus- PPI 


AA 






74 


Consensus- PPI/Generic 


NA 






75 


Consensus- PPI/Genreric 


AA 







In a BLAST search of public sequence databases, it was found, for example, 
that the Arabidopsis thaliana nucleic acid sequence has 99.5 % identity to an 
Arabidopsis thaliana CaaX processing zinc-metallo endoprotease (AFC1) mRNA 
5 (Genbank Accesion No.: AF353722). The full amino acid sequence of the protein of the 
invention was found to be 98.8 % identical to Arabidopsis thaliana CaaX processing 
zinc-metallo endoprotease (AFC1) polypeptide (Genbank Accesion No.:AAK39514). A 
ClustalW alignment of the Arabidopsis thaliana CPP polypeptide (SEQ ID NO:2), the 
Brassica napus CPP polypeptide (SEQ ID NO: 15), the Glycine max CPP polypeptide 

10 (SEQ ID NO: 18) and seven other published CPP sequences is illustrated in Table 6B. 
ClustalW alignment of these polypeptides indicate that SEQ ID NO:2, SEQ ID NO: 15 
and SEQ ID NO: 18 are 99%, 93% and 83% identical to the published AFC sequence 
(SEQ ID NO:30) respectively. The Glycine max CPP polypeptide (SEQ ID NO: 18) is 
99% identical to the published sequence shown as SEQ ID NO:28. Similarly, ClustalW 

15 alignment of the Arabidopsis thaliana CPP polynucleotide (SEQ ID NO:l), the Brassica 
napus CPP polynucleotide (SEQ ID NO: 14), the Glycine max CPP polynucleotide (SEQ 
ID NO: 17) and seven other published CPP sequences is illustrated in Table 6a indicate 
that SEQ ID NO:l, SEQ ID NO:14 and SEQ ID NO:17 are 99%, 93% and 77% identical 
to the published AFC sequence (SEQ ID NO:30) respectively. The Glycine max CPP 

20 polynucleotide (SEQ ID NO: 17) is 93% identical to the published sequence shown as 
SEQ ID NO:27. 

CaaX prenyl proteases belong to a family of putative membrane-bound proteins 

that are involved in protein and/or peptide modification (i.e., prenylation) and secretion. 

Prenylation is a post translational modification of specific proteins and is required for the 

9 
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proper localization of these polypeptides to the correct cellular site for functionality. 
Prenylation is a three step process involving the addition of either a CI 5 farnesyl, or C20 
geranylgeranyl group to the cysteine residue of the target 3 f terminal CaaX sequence, 
where "C" is a cysteine, "a" is any aliphatic amino acid and "X" is any amino acid. 
5 Secondly, a CaaX prenyl protease (CPP) cleaves the -aaX tripeptide from the protein and 
thirdly the exposed ot-carboxyl group of the cysteine is methylated by a prenyl-cysteine 
methyltransferase. 

Protein farnesylation, the addition of a C-terminal, 1 5 carbon chain to protein and 
subsequent processing is a three step enzymatic reaction including farnesylation, 

10 proteolytic cleavage and methylation. First, a farnesyltransferase adds the C-terminal 15 
carbon chain to a conserved cysteine residue of the CaaX terminal motif, where "C" is a 
Cystine, "a" is an aliphatic amino acid and "X" is any amino acid. Second, the last three 
amino acid residues (aaX) are cleaved by a prenyl protease. Lastly, the 
modified cysteine is methylated by a methylase to create the final active product of the 

1 5 protein farnesylation pathway. The Applicants have shown previously that over 

expression and down-regulation of the alpha or the beta farnesyl transferase gene in plant 
cells ( i.e, the first step in farnesylation) results in plants with an altered phenotype such 
as but not limited to drought tolerance and delayed senescence. The present invention 
shows that over expression and down-regulation of the prenyl protease gene (i.e, the 

20 second step in farnesylation) in plant cells also results in a plant displaying an altered 
phenotype including for example but not limited to drought tolerance and increased 
resistance to biotic and abiotic stress. These results taken together support the 
hypothesis that modification of the expression of any of the enzymes in the 
farnnesylation pathway in a plant cell will result in a plant displaying an altered 

25 phenotype 

Based on their structural and functional relatedness to known CaaX prenyl 
protease proteins, the CPP proteins are novel members of the CaaX prenyl protease 
family of proteins. CPP nucleic acids, and their encoded polypeptides, according to the 
invention are useful in a variety of applications and contexts. For example, the nucleic 
30 acids (i.e., sense or antisense CPP nucleic acids) can be used produce transgenic plants 
that have an increase resistance to biotic and abiotic stresses, e.g., chilling stress, salt 
stress, water stress, wound healing, pathogen challenge, or herbicides. Additionally, the 
transgenic plants have an increased productivity during both optimal and suboptimal 
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growth conditions, increased yield, or increased biomass. Alternatively, the transgenic 
plants have an increased sensitivity to the phytohormone abscisic acid (ABA). 

This invention includes methods to up-regulate the CPP enzyme activity in 
transgenic plants, cells and tissue cultures by using an over-expression vector construct 
5 and methods to down-regulate the CPP enzyme activity in transgenic plants, cells and 
tissue cultures by using a double stranded RNA-inhibition, hairpin vector constructs or 
antisense constructs. Alteration (i.e., upregulation or downregulation) of CPP enzyme 
activity or expression results in transgenic plants with altered phenotypes as described 
below. These methods are by way of example to produce the up-regulation or down- 
10 regulation effects and are not meant to be limiting as to the method of achieving this 
outcome. 

Additionally, the nucleic acids and polypeptides according to the invention may 
be used as targets for the identification of small molecules that modulate or inhibit, CPP 
activity. Alternatively, the CPP nucleic acids and polypeptides can be used to identify 
15 proteins that are members of the CaaX prenyl protease family of proteins. 

Additional utilities for CPP nucleic acids and polypeptides according to the 
invention are disclosed herein. 

CPP Nucleic Acids 

The nucleic acids of the invention include those that encode a CPP polypeptide or 
20 protein. As used herein, the terms polypeptide and protein are interchangeable. 

In some embodiments, a CPP nucleic acid encodes a mature CPP polypeptide. As 
used herein, a "mature" form of a polypeptide or protein described herein relates to the 
product of a naturally occurring polypeptide or precursor form or proprotein. The 
naturally occurring polypeptide, precursor or proprotein includes, by way of nonlimiting 
25 example, the full length gene product, encoded by the corresponding gene. 

Alternatively, it may be defined as the polypeptide, precursor or proprotein encoded by 
an open reading frame described herein. The product "mature" form arises, again by 
way of nonlimiting example, as a result of one or more naturally occurring processing 
steps that may take place within the cell in which the gene product arises. Examples of 
30 such processing steps leading to a "mature" form of a polypeptide or protein include the 
cleavage of the N-terminal methionine residue encoded by the initiation codon of an 
open reading frame, or the proteolytic cleavage of a signal peptide or leader sequence. 
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Thus a mature form arising from a precursor polypeptide or protein that has residues 1 to 
N, where residue 1 is the N-terminal methionine, would have residues 2 through N 
remaining after removal of the N-terminal methionine. Alternatively, a mature form 
arising from a precursor polypeptide or protein having residues 1 to N, in which an N- 
5 terminal signal sequence from residue 1 to residue M is cleaved, would have the residues 
from residue M+l to residue N remaining. Further as used herein, a "mature" form of a 
polypeptide or protein may arise from a step of post-translational modification other than 
a proteolytic cleavage event. Such additional processes include, by way of non-limiting 
example, glycosylation, myristoylation or phosphorylation. In general, a mature 

10 polypeptide or protein may result from the operation of only one of these processes, or a 
combination of any of them. 

Among the CPP nucleic acids is the nucleic acid whose sequence is provided in 
SEQ ID NO: 1, SEQ ID NO: 14 OR SEQ ID NO: 17 or a fragment thereof. Additionally, 
the invention includes mutant or variant nucleic acids of SEQ ID NO: 1, SEQ ID NO: 14 

15 OR SEQ ID NO: 17 or a fragment thereof, any of whose bases may be changed from the 
corresponding base shown in SEQ ID NO: 1, SEQ ID NO:14 or SEQ ID NO:17, while 
still encoding a protein that maintains at least one of its CPP-like activities and 
physiological functions. The invention further includes the complement of the nucleic 
acid sequence of SEQ ID NO: 1, SEQ ID NO:14 or SEQ ID NO:17, including 

20 fragments, derivatives, analogs and homologs thereof. Complement nucleic acid CPP 
sequences include SEQ ID NO: 16, 19 or 20. The invention additionally includes 
nucleic acids or nucleic acid fragments, or complements thereto, whose structures 
include chemical modifications. 

One aspect of the invention pertains to isolated nucleic acid molecules that 

25 encode CPP proteins or biologically active portions thereof. Also included are nucleic 
acid fragments sufficient for use as hybridization probes to identify CPP-encoding 
nucleic acids (e.g., CPP mRNA) and fragments for use as polymerase chain reaction 
(PCR) primers for the amplification or mutation of CPP nucleic acid molecules. As used 
herein, the term "nucleic acid molecule" is intended to include DNA molecules (e.g., 

30 cDNA or genomic DNA), RNA molecules (e.g., mRNA), analogs of the DNA or RNA 
generated using nucleotide analogs, and derivatives, fragments and homologs thereof. 
The nucleic acid molecule can be single-stranded or double-stranded, but preferably is 
double-stranded DNA. 
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"Probes" refer to nucleic acid sequences of variable length, preferably between at 
least about 10 nucleotides (nt), 100 nt, or as many as about, e.g., 6,000 nt, depending on 
use. Probes are used in the detection of identical, similar, or complementary nucleic acid 
sequences. Longer length probes are usually obtained from a natural or recombinant 
5 source, are highly specific and much slower to hybridize than oligomers. Probes may be 
single- or double-stranded and designed to have specificity in PCR, membrane-based 
hybridization technologies, or ELISA-like technologies. 

An "isolated" nucleic acid molecule is one that is separated from other nucleic 
acid molecules that are present in the natural source of the nucleic acid. Examples of 

10 isolated nucleic acid molecules include, but are not limited to, recombinant DNA 
molecules contained in a vector, recombinant DNA molecules maintained in a 
heterologous host cell, partially or substantially purified nucleic acid molecules, and 
synthetic DNA or RNA molecules. Preferably, an "isolated" nucleic acid is free of 
sequences which naturally flank the nucleic acid (i.e., sequences located at the 5' and 3* 

15 ends of the nucleic acid) in the genomic DNA of the organism from which the nucleic 
acid is derived. For example, in various embodiments, the isolated CPP nucleic acid 
molecule can contain less than about 50 kb, 25 kb, 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb or 
0.1 kb of nucleotide sequences which naturally flank the nucleic acid molecule in 
genomic DNA of the cell from which the nucleic acid is derived. Moreover, an 

20 "isolated" nucleic acid molecule, such as a cDNA molecule, can be substantially free of 
other cellular material or culture medium when produced by recombinant techniques, or 
of chemical precursors or other chemicals when chemically synthesized. 

A nucleic acid molecule of the present invention, e.g., a nucleic acid molecule 
having the nucleotide sequence of SEQ ID NO: 1, SEQ ID NO: 14, or SEQ ID NO:17or a 

25 complement of any of this nucleotide sequence, can be isolated using standard molecular 
biology techniques and the sequence information provided herein. Using all or a portion 
of the nucleic acid sequence of SEQ ID NO: 1, SEQ ID NO:14 or SEQ ID NO:17 as a 
hybridization probe, CPP nucleic acid sequences can be isolated using standard 
hybridization and cloning techniques (e.g., as described in Sambrook et al, eds., 

30 Molecular Cloning: A Laboratory Manual 2 nd Ed., Cold Spring Harbor 

Laboratory Press, Cold Spring Harbor, NY, 1989; and Ausubel, et ah, eds., Current 
Protocols in Molecular Biology, John Wiley & Sons, New York, NY, 1993.) 

A nucleic acid of the invention can be amplified using cDNA, mRNA or 
alternatively, genomic DNA, as a template and appropriate oligonucleotide primers 
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according to standard PCR amplification techniques. The nucleic acid so amplified can 
be cloned into an appropriate vector and characterized by DNA sequence analysis. 
Furthermore, oligonucleotides corresponding to CPP nucleotide sequences can be 
prepared by standard synthetic techniques, e.g., using an automated DNA synthesizer. 
5 As used herein, the term "oligonucleotide" refers to a series of linked nucleotide 

residues, which oligonucleotide has a sufficient number of nucleotide bases to be used in 
a PCR reaction. A short oligonucleotide sequence may be based on, or designed from, a 
genomic or cDNA sequence and is used to amplify, confirm, or reveal the presence of an 
identical, similar or complementary DNA or RNA in a particular cell or tissue. 

10 Oligonucleotides comprise portions of a nucleic acid sequence having about 10 nt, 50 nt, 
or 100 nt in length, preferably about 15 nt to 30 nt in length. In one embodiment, an 
oligonucleotide comprising a nucleic acid molecule less than 100 nt in length would 
further comprise at lease 6 contiguous nucleotides of SEQ ID NO: 1, 14 or 17, or a 
complement thereof. Oligonucleotides may be chemically synthesized and may be used 

15 as probes. 

In another embodiment, an isolated nucleic acid molecule of the invention 
includes a nucleic acid molecule that is a complement of the nucleotide sequence shown 
in SEQ ID NO: 1, SEQ ID NO:14 or SEQ ID NO:17. In another embodiment, an 
isolated nucleic acid molecule of the invention comprises a nucleic acid molecule that is 

20 a complement of the nucleotide sequence shown in SEQ ID NO: 1, SEQ ID NO: 14 or 
SEQ ID NO: 17, or a portion of these nucleotide sequence. A nucleic acid molecule that 
is complementary to the nucleotide sequence shown in SEQ ID NO: 1, SEQ ID NO: 14 or 
SEQ ID NO: 17 is one that is sufficiently complementary to the nucleotide sequence 
shown in SEQ ID NO: 1, SEQ ID NO:14 or SEQ ID NO:17 that it can hydrogen bond 

25 with little or no mismatches to the nucleotide sequence shown in SEQ ID NO: 1, SEQ ID 
NO: 14 or SEQ ID NO: 17, thereby forming a stable duplex. Exemplary complement 
nucleic acid sequences include the sequences of SEQ ID NO: 16, 19 or 20. 

As used herein, the term "complementary" refers to Watson-Crick or Hoogsteen 
base pairing between nucleotide units of a nucleic acid molecule, and the term "binding" 

30 means the physical or chemical interaction between two polypeptides or compounds or 
associated polypeptides or compounds or combinations thereof. Binding includes ionic, 
non-ionic, Von der Waals, hydrophobic interactions, etc. A physical interaction can be 
either direct or indirect. Indirect interactions may be through or due to the effects of 
another polypeptide or compound. Direct binding refers to interactions that do not take 
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place through, or due to, the effect of another polypeptide or compound, but instead are 
without other substantial chemical intermediates. 

Moreover, the nucleic acid molecule of the invention can comprise only a portion 
of the nucleic acid sequence of SEQ ID NO: 1, SEQ ID NO: 14 or SEQ ID NO: 17, e.g., 
5 a fragment that can be used as a probe or primer, or a fragment encoding a biologically 
active portion of CPP. Fragments provided herein are defined as sequences of at least 6 
(contiguous) nucleic acids or at least 4 (contiguous) amino acids, a length sufficient to 
allow for specific hybridization in the case of nucleic acids or for specific recognition of 
an epitope in the case of amino acids, respectively, and are at most some portion less 

10 than a full length sequence. Fragments may be derived from any contiguous portion of a 
nucleic acid or amino acid sequence of choice. Derivatives are nucleic acid sequences or 
amino acid sequences formed from the native compounds either directly or by 
modification or partial substitution. Analogs are nucleic acid sequences or amino acid 
sequences that have a structure similar to, but not identical to, the native compound but 

15 differs from it in respect to certain components or side chains. Analogs may be synthetic 
or from a different evolutionary origin and may have a similar or opposite metabolic 
activity compared to wild type. 

Derivatives and analogs may be full length or other than full length, if the 
derivative or analog contains a modified nucleic acid or amino acid, as described below. 

20 Derivatives or analogs of the nucleic acids or proteins of the invention include, but are 
not limited to, molecules comprising regions that are substantially homologous to the 
nucleic acids or proteins of the invention, in various embodiments, by at least about 70%, 
80%, 85%, 90%, 95%, 98%, or even 99% identity (with a preferred identity of 80-99%) 
over a nucleic acid or amino acid sequence of identical size or when compared to an 

25 aligned sequence in which the alignment is done by a computer homology program 
known in the art, or whose encoding nucleic acid is capable of hybridizing to the 
complement of a sequence encoding the aforementioned proteins under stringent, 
moderately stringent, or low stringent conditions. See e.g. Ausubel, et al. 9 Current 
Protocols in Molecular Biology, John Wiley & Sons, New York, NY, 1993, and 

30 below. An exemplary program is the Gap program (Wisconsin Sequence Analysis 
Package, Version 8 for UNIX, Genetics Computer Group, University Research Park, 
Madison, WI) using the default settings, which uses the algorithm of Smith and 
Waterman (Adv. Appl. Math., 1981, 2: 482-489, which is incorporated herein by 
reference in its entirety). A "homologous nucleic acid sequence" or "homologous amino 
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acid sequence," or variations thereof, refer to sequences characterized by a homology at 
the nucleotide level or amino acid level as discussed above. Homologous nucleotide 
sequences encode those sequences coding for isoforms of a CPP polypeptide. Isoforms 
can be expressed in different tissues of the same organism as a result of, for example, 
5 alternative splicing of RNA. Alternatively, isoforms can be encoded by different genes. 
Homologous nucleotide sequences also include, but are not limited to, naturally 
occurring allelic variations and mutations of the nucleotide sequences set forth herein. 
Exemplary homologous nucleic acid sequences include the nucleic acid sequences of 
SEQ ID NO: 68, 70, 72 and 74. Homologous nucleic acid sequences include those 

10 nucleic acid sequences that encode conservative amino acid substitutions (see below) in 
SEQ ID NO:2, SEQ ID NO:15 and SEQ ID NO:18 , as well as a polypeptide having CPP 
activity, e.g. substrate binding. 

The nucleotide sequence determined from the cloning of the Arabidopsis 
thaliana, Brassica napus or Glycine max CPP gene allows for the generation of probes 

15 and primers designed for use in identifying and/or cloning CPP homologues in other cell 
types, e.g., from other tissues, as well as CPP homologues from other plants. The 
probe/primer typically comprises a substantially purified oligonucleotide. The 
oligonucleotide typically comprises a region of nucleotide sequence that hybridizes 
under stringent conditions to at least about 12, 25, 50, 100, 150, 200, 250, 300, 350 or 

20 400 or more consecutive sense strand nucleotide sequence of SEQ ID NO: 1, SEQ ID 
NO: 14 or SEQ ID NO: 17; or an anti-sense strand nucleotide sequence of SEQ ID NO: 1, 
SEQ ID NO: 14 or SEQ ID NO: 17; or of a naturally occurring mutant of SEQ ID NO: 1, 
SEQ ID NO: 14 or SEQ ID NO: 17. 

Probes based on the Arabidopsis thaliana, Brassica napus or Glycine max CPP 

25 nucleotide sequence can be used to detect transcripts or genomic sequences encoding the 
same or homologous proteins. In various embodiments, the probe further comprises a 
label group attached thereto, e.g., the label group can be a radioisotope, a fluorescent 
compound, an enzyme, or an enzyme co-factor. Such probes can be used as a part of a 
diagnostic test kit for identifying cells or tissue which misexpress a CPP protein, such as 

30 by measuring a level of a CPP-encoding nucleic acid in a sample of cells from a subject 
e.g., detecting CPP mRNA levels or determining whether a genomic CPP gene has been 
mutated or deleted. 

A "polypeptide having a biologically active portion of CPP" refers to 
polypeptides exhibiting activity similar, but not necessarily identical to, an activity of a 
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polypeptide of the present invention, including mature forms, as measured in a particular 
biological assay, with or without dose dependency. A nucleic acid fragment encoding a 
"biologically active portion of CPP" can be prepared by isolating a portion of SEQ ID 
NO: 1, SEQ ID NO:14 or SEQ ID NO:17 that encodes a polypeptide having a CPP 
5 biological activity (biological activities of the CPP proteins are described below), 

expressing the encoded portion of CPP protein (e.g., by recombinant expression in vitro) 
and assessing the activity of the encoded portion of CPP. In another embodiment, a 
nucleic acid fragment encoding a biologically active portion of CPP includes one or 
more regions. 

10 

CPP Variants 

The invention further encompasses nucleic acid molecules that differ from the 
nucleotide sequences shown in SEQ ID NO: 1, SEQ ID NO:14 or SEQ ID NO:17 due to 
the degeneracy of the genetic code. These nucleic acids thus encode the same CPP 

15 protein as that encoded by the nucleotide sequence shown in SEQ ID NO: 1, SEQ ID 
NO: 14 or SEQ ID NO: 17, e.g., the polypeptide of SEQ ID NO: 2, SEQ ID NO: 15, SEQ 
ID NO: 18. In another embodiment, an isolated nucleic acid molecule of the invention 
has a nucleotide sequence encoding a protein having an amino acid sequence shown in 
SEQ ID NO: 2, SEQ ID NO: 1 5, SEQ ID NO: 1 8. 

20 In addition to the Arabidopsis thaliana, Brassica napus or Glycine max CPP 

nucleotide sequence shown in SEQ ID NO: 1, SEQ ID NO:14 or SEQ ID NO:17, it will 
be appreciated by those skilled in the art that DNA sequence polymorphisms that lead to 
changes in the amino acid sequences of CPP may exist within a population (e.g., the 
plant). Such genetic polymorphism in the CPP gene may exist among individuals within 

25 a population due to natural allelic variation. As used herein, the terms "gene" and 

"recombinant gene" refer to nucleic acid molecules comprising an open reading frame 
encoding a CPP protein, preferably a plant CPP protein. Such natural allelic variations 
can typically result in 1-5% variance in the nucleotide sequence of the CPP gene. Any 
and all such nucleotide variations and resulting amino acid polymorphisms in CPP that 

30 are the result of natural allelic variation and that do not alter the functional activity of 

CPP are intended to be within the scope of the invention. 

Moreover, nucleic acid molecules encoding CPP proteins from other species, and 

thus that have a nucleotide sequence that differs from the sequence of SEQ ID NO: 1, 

SEQ ID NO:14 or SEQ ID NO: 17 are intended to be within the scope of the invention. 

17 
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Nucleic acid molecules corresponding to natural allelic variants and homologues of the 
CPP cDNAs of the invention can be isolated based on their homology to the Arabidopsis 
thaliana, Brassica napus or Glycine max CPP nucleic acids disclosed herein using the 
cDNAs, or a portion thereof, as a hybridization probe according to standard 
5 hybridization techniques under stringent hybridization conditions. 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the 
invention is at least 6 nucleotides in length and hybridizes under stringent conditions to 
the nucleic acid molecule comprising the nucleotide sequence of SEQ ID NO: 1, SEQ 
ID NO: 14 or SEQ ID NO: 17. In another embodiment, the nucleic acid is at least 10, 25, 

10 50, 100, 250, 500 or 750 nucleotides in length. In another embodiment, an isolated 

nucleic acid molecule of the invention hybridizes to the coding region. As used herein, 
the term "hybridizes under stringent conditions" is intended to describe conditions for 
hybridization and washing under which nucleotide sequences at least 60% homologous 
to each other typically remain hybridized to each other. 

15 Homologs (i.e., nucleic acids encoding CPP proteins derived from species other 

than Arabidopsis thaliana, Brassica napus or Glycine max) or other related sequences 
(e.g., paralogs) can be obtained by low, moderate or high stringency hybridization with 
all or a portion of the particular sequence as a probe using methods well known in the 
art for nucleic acid hybridization and cloning. 

20 As used herein, the phrase "stringent hybridization conditions" refers to 

conditions under which a probe, primer or oligonucleotide will hybridize to its target 
sequence, but to no other sequences. Stringent conditions are sequence-dependent and 
will be different depending upon circumstances. Longer sequences hybridize 
specifically at higher temperatures than shorter sequences. Generally, stringent 

25 conditions are selected to be about 5°C lower than the thermal melting point (T m ) for the 
specific sequence at a defined ionic strength and pH. The Tm is the temperature (under 
defined ionic strength, pH and nucleic acid concentration) at which 50% of the probes 
complementary to the target sequence hybridize to the target sequence at equilibrium. 
Since the target sequences are generally present at excess, at Tm, 50% of the probes are 

30 occupied at equilibrium. Typically, stringent conditions will be those in which the salt 
concentration is less than about 1.0 M sodium ion, typically about 0.01 to 1.0 M sodium 
ion (or other salts) at pH 7.0 to 8.3 and the temperature is at least about 30°C for short 
probes, primers or oligonucleotides (e.g., 10 nt to 50 nt) and at least about 60°C for 
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longer probes, primers and oligonucleotides. Stringent conditions may also be achieved 
with the addition of destabilizing agents, such as formamide. 

Stringent conditions are known to those skilled in the art and can be found in 
Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989), 
5 6.3.1-6.3.6. Preferably, the conditions are such that sequences at least about 65%, 70%, 
75%, 85%, 90%, 95%, 98%, or 99% homologous to each other typically remain 
hybridized to each other. A non-limiting example of stringent hybridization conditions is 
hybridization in a high salt buffer comprising 6X SSC, 50 mM Tris-HCl (pH 7.5), 1 mM 
EDTA, 0.02% PVP, 0.02% Ficoll, 0.02% BSA, and 500 mg/ml denatured salmon sperm 

10 DNA at 65°C. This hybridization is followed by one or more washes in 0.2X SSC, 

0.01% BSA at 50°C. An isolated nucleic acid molecule of the invention that hybridizes 
under stringent conditions to the sequence of SEQ ID NO: 1, SEQ ID NO: 14 or SEQ ID 
NO: 17 corresponds to a naturally occurring nucleic acid molecule. As used herein, a 
"naturally-occurring" nucleic acid molecule refers to an RNA or DNA molecule having a 

15 nucleotide sequence that occurs in nature (e.g., encodes a natural protein). 

In a second embodiment, a nucleic acid sequence that is hybridizable to the 
nucleic acid molecule comprising the nucleotide sequence of SEQ ID NO: 1, SEQ ID 
NO: 14 or SEQ ID NO: 17, or fragments, analogs or derivatives thereof, under conditions 
of moderate stringency is provided. A non-limiting example of moderate stringency 

20 hybridization conditions are hybridization in 6X SSC, 5X Denhardt's solution, 0.5% SDS 
and 100 mg/ml denatured salmon sperm DNA at 55°C, followed by one or more washes 
in IX SSC, 0.1% SDS at 37°C. Other conditions of moderate stringency that may be 
used are well known in the art. See, e.g., Ausubel et ah feds.), 1993, Current 
Protocols in Molecular Biology, John Wiley & Sons, NY, and Kriegler, 1 990, 

25 Gene Transfer and Expression, A Laboratory Manual, Stockton Press, NY. 

In a third embodiment, a nucleic acid that is hybridizable to the nucleic acid 
molecule comprising the nucleotide sequence of SEQ ID NO: 1 or SEQ ID NO: 14, or 
SEQ ID NO: 17 or fragments, analogs or derivatives thereof, under conditions of low 
stringency, is provided. A non-limiting example of low stringency hybridization 

30 conditions are hybridization in 35% formamide, 5X SSC, 50 mM Tris-HCl (pH 7.5), 5 
mM EDTA, 0.02% PVP, 0.02% Ficoll, 0.2% BSA, 100 mg/ml denatured salmon sperm 
DNA, 10% (wt/vol) dextran sulfate at 40°C, followed by one or more washes in 2X SSC, 
25 mM Tris-HCl (pH 7.4), 5 mM EDTA, and 0.1% SDS at 50°C. Other conditions of 
low stringency that may be used are well known in the art (e.g., as employed for 
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cross-species hybridizations). See, e.g., Ausubel et al feds.), 1993, Current 
Protocols in Molecular Biology, John Wiley & Sons, NY, and Kriegler, 1990, 
Gene Transfer and Expression, A Laboratory Manual, Stockton Press, NY; Shilo 
and Weinberg, 1981, Proc Natl Acad Sci USA 78: 6789-6792. 

5 

Conservative mutations 

In addition to naturally-occurring allelic variants of the CPP sequence that may 
exist in the population, the skilled artisan will further appreciate that changes can be 
introduced by mutation into the nucleotide sequence of SEQ ID NO: 1, SEQ ID NO: 14 

10 or SEQ ID NO: 17, thereby leading to changes in the amino acid sequence of the 

encoded CPP protein, without altering the functional ability of the CPP protein. For 
example, nucleotide substitutions leading to amino acid substitutions at "non-essential" 
amino acid residues can be made in the sequence of SEQ ID NO: 1, SEQ ID NO: 14 or 
SEQ ID NO: 17. A "non-essential" amino acid residue is a residue that can be altered 

15 from the wild-type sequence of CPP without altering the biological activity, whereas an 
"essential" amino acid residue is required for biological activity. For example, amino 
acid residues that are conserved among the CPP proteins of the present invention, are 
predicted to be particularly unamenable to alteration. 

Another aspect of the invention pertains to nucleic acid molecules encoding CPP 

20 proteins that contain changes in amino acid residues that are not essential for activity. 
Such CPP proteins differ in amino acid sequence from SEQ ID NO: 2, SEQ ID NO: 1 5 
or SEQ ID NO: 18, yet retain biological activity. In one embodiment, the isolated nucleic 
acid molecule comprises a nucleotide sequence encoding a protein, wherein the protein 
comprises an amino acid sequence at least about 75% homologous to the amino acid 

25 sequence of SEQ ID NO: 2, SEQ ID NO: 1 5 or SEQ ID NO: 1 8. Preferably, the protein 
encoded by the nucleic acid is at least about 80% homologous to S SEQ ID NO: 2, SEQ 
ID NO: 15 or SEQ ID NO: 18 more preferably at least about 90%, 95%, 98%, and most 
preferably at least about 99% homologous to SEQ ID NO: 2, SEQ ID NO: 15 or SEQ ID 
NO:18. 

30 An isolated nucleic acid molecule encoding a CPP protein homologous to the 

protein of SEQ ID NO: 2, SEQ ID NO: 15 or SEQ ID NO: 18 can be created by 
introducing one or more nucleotide substitutions, additions or deletions into the 
nucleotide sequence of SEQ ID NO: 1 or SEQ ID NO: 14, or SEQ ID NO: 17 such that 
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one or more amino acid substitutions, additions or deletions are introduced into the 
encoded protein. 

Mutations can be introduced into the nucleotide sequence of SEQ ID NO: 1, SEQ 
ID NO: 14 or SEQ ID NO: 17 by standard techniques, such as site-directed mutagenesis 
5 and PCR-mediated mutagenesis. Preferably, conservative amino acid substitutions are 
made at one or more predicted non-essential amino acid residues. A "conservative 
amino acid substitution" is one in which the amino acid residue is replaced with an 
amino acid residue having a similar side chain. Families of amino acid residues having 
similar side chains have been defined in the art. These families include amino acids with 

10 basic side chains (e.g., lysine, arginine, histidine), acidic side chains (e.g., aspartic acid, 
glutamic acid), uncharged polar side chains (e.g., glycine, asparagine, glutamine, serine, 
threonine, tyrosine, cysteine), nonpolar side chains (e.g. , alanine, valine, leucine, 
isoleucine, proline, phenylalanine, methionine, tryptophan), beta-branched side chains 
(e.g., threonine, valine, isoleucine) and aromatic side chains (e.g., tyrosine, 

15 phenylalanine, tryptophan, histidine). Thus, a predicted nonessential amino acid residue 
in CPP is replaced with another amino acid residue from the same side chain family. 
Alternatively, in another embodiment, mutations can be introduced randomly along all or 
part of a CPP coding sequence, such as by saturation mutagenesis, and the resultant 
mutants can be screened for CPP biological activity to identify mutants that retain 

20 activity. Following mutagenesis of SEQ ID NO: 1, SEQ ID NO:14 or SEQ ID NO:17 
the encoded protein can be expressed by any recombinant technology known in the art 
and the activity of the protein can be determined. 

In one embodiment, a mutant CPP protein can be assayed for (1) the ability to 
form proteimprotein interactions with other CPP proteins, other cell-surface proteins, or 

25 biologically active portions thereof, (2) complex formation between a mutant CPP 
protein and a CPP receptor; (3) the ability of a mutant CPP protein to bind to an 
intracellular target protein or biologically active portion thereof; (e.g., avidin proteins); 
(4) the ability to bind CPP protein; or (5) the ability to specifically bind an anti-CPP 
protein antibody. 

30 

Antisense CPP Nucleic Acids 

Another aspect of the invention pertains to isolated antisense nucleic acid 

molecules that are hybridizable to or complementary to the nucleic acid molecule 

comprising the nucleotide sequence of SEQ ID NO: 1 or SEQ ID NO: 14 or SEQ ID 

21 
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NO: 17 , or fragments, analogs or derivatives thereof. An "antisense" nucleic acid 
comprises a nucleotide sequence that is complementary to a "sense" nucleic acid 
encoding a protein, e.g., complementary to the coding strand of a double-stranded cDNA 
molecule or complementary to an mRNA sequence. In specific aspects, antisense 
5 nucleic acid molecules are provided that comprise a sequence complementary to at least 
about 10, 25, 50, 100, 250 or 500 nucleotides or an entire CPP coding strand, or to only a 
portion thereof. Nucleic acid molecules encoding fragments, homologs, derivatives and 
analogs of a CPP protein of SEQ ID NO: 2 or SEQ ID NO: 15 or SEQ ID NO: 18 or 
antisense nucleic acids complementary to a CPP nucleic acid sequence of SEQ ID NO: 1, 

10 SEQ ID NO: 14 or SEQ ID NO: 1 7 are additionally provided. Exemplary CPP anti-sense 
nucleic acid include the nucleic acid sequences of SEQ ID NO: 16, 19, and 20. 

In one embodiment, an antisense nucleic acid molecule is antisense to a "coding 
region" of the coding strand of a nucleotide sequence encoding CPP. The term "coding 
region" refers to the region of the nucleotide sequence comprising codons which are 

15 translated into amino acid residues {e.g., the protein coding region of Arabidopsis 

thaliana, Brassica napus or Glycine max CPP corresponds to SEQ ID NO: 2 or SEQ ID 
NO: 15 or SEQ ID NO: 18). In another embodiment, the antisense nucleic acid molecule 
is antisense to a "noricoding region" of the coding strand of a nucleotide sequence 
encoding CPP. The term "noncoding region" refers to 5 f and 3' sequences which flank 

20 the coding region that are not translated into amino acids {i.e., also referred to as 5' and 3' 
untranslated regions). 

Given the coding strand sequences encoding CPP disclosed herein {e.g., SEQ ID 
NO: 1 or SEQ ID NO: 14 or SEQ ID NO: 17), antisense nucleic acids of the invention can 
be designed according to the rules of Watson and Crick or Hoogsteen base pairing. The 

25 antisense nucleic acid molecule can be complementary to the entire coding region of 

CPP mRNA, but more preferably is an oligonucleotide that is antisense to only a portion 
of the coding or noncoding region of CPP mRNA. For example, the antisense 
oligonucleotide can be complementary to the region surrounding the translation start site 
of CPP mRNA. An antisense oligonucleotide can be, for example, about 5, 10, 15, 20, 

30 25, 30, 35, 40, 45 or 50 nucleotides in length. An antisense nucleic acid of the invention 
can be constructed using chemical synthesis or enzymatic ligation reactions using 
procedures known in the art. For example, an antisense nucleic acid {e.g., an antisense 
oligonucleotide) can be chemically synthesized using naturally occurring nucleotides or 
variously modified nucleotides designed to increase the biological stability of the 



WO 03/012116 PCT/IB02/03887 

molecules or to increase the physical stability of the duplex formed between the 
antisense and sense nucleic acids, e.g., phosphorothioate derivatives and acridine 
substituted nucleotides can be used. 

Examples of modified nucleotides that can be used to generate the antisense 
5 nucleic acid include: 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, 
hypoxanthine, xanthine, 4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 
5-carboxymethylaminomethyl-2-thiouridine, 5-carboxymethylaminomethyluracil, 
dihydrouracil, beta-D-galactosylqueosine, inosine, N6-isopentenyladenine, 

1- methylguanine, 1-methylinosine, 2,2-dimethylguanine, 2-methyladenine, 

10 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7-methylguanine, 
5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, 
beta-D-mannosylqueosine, 5'-methoxycarboxymethyluracil, 5-methoxyuracil, 

2- methylthio-N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, 
pseudouracil, queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 

15 5-methyluracil, uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 
5-methyl-2-thiouracil, 3-(3-amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 
2,6-diaminopurine. Alternatively, the antisense nucleic acid can be produced 
biologically using an expression vector into which a nucleic acid has been subcloned in 
an antisense orientation (i.e., RNA transcribed from the inserted nucleic acid will be of 

20 an antisense orientation to a target nucleic acid of interest, described further in the 
following subsection). 

The antisense nucleic acid molecules of the invention are generated in situ such 
that they hybridize with or bind to cellular mRNA and/or genomic DNA encoding a CPP 
protein to thereby inhibit expression of the protein, e.g., by inhibiting transcription 

25 and/or translation. The hybridization can be by conventional nucleotide 

complementarity to form a stable duplex, or, for example, in the case of an antisense 
nucleic acid molecule that binds to DNA duplexes, through specific interactions in the 
major groove of the double helix. An example of a route of administration of antisense 
nucleic acid molecules of the invention includes direct injection at a tissue site. 

30 Alternatively, antisense nucleic acid molecules can be modified to target selected cells 
and then administered systemically. For example, for systemic administration, antisense 
molecules can be modified such that they specifically bind to receptors or antigens 
expressed on a selected cell surface, e.g., by linking the antisense nucleic acid molecules 
to peptides or antibodies that bind to cell surface receptors or antigens. The antisense 
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nucleic acid molecules can also be delivered to cells using the vectors described herein. 
To achieve sufficient intracellular concentrations of antisense molecules, vector 
constructs in which the antisense nucleic acid molecule is placed under the control of a 
strong pol II or pol III promoter are preferred. 
5 In yet another embodiment, the antisense nucleic acid molecule of the invention 

is an ot-anomeric nucleic acid molecule. An a-anomeric nucleic acid molecule forms 
specific double-stranded hybrids with complementary RNA in which, contrary to the 
usual P-units, the strands run parallel to each other (Gaultier et al. (1987) Nucleic Acids 
Res 15: 6625-6641). The antisense nucleic acid molecule can also comprise a 

10 2-o-methylribonucleotide (Inoue et al (1987) Nucleic Acids Res 15: 6131-6148) or a 
chimeric RNA -DNA analogue (Inoue et al (1987) FEES Lett 215: 327-330). 

Such modifications include, by way of nonlimiting example, modified bases, and 
nucleic acids whose sugar phosphate backbones are modified or derivatized. These 
modifications are carried out at least in part to enhance the chemical stability of the 

15 modified nucleic acid, such that they may be used, for example, as antisense binding 
nucleic acids in applications. 

Double Stranded RNA Inhibition (RNAi) by Hairpin Nucleic Acids 

Another aspect of the invention pertains to the use of post transcriptional gene 
20 silencing (PTGS) to repress gene expression. Double stranded RNA can initiate the 

sequence specific repression of gene expression in plants and animals. Double stranded 
RNA is processed to short duplex oligomers of 21-23 nucleotides in length. These small 
interfering RNA's suppress the expression of endogenous and heterologous genes in a 
sequence specific manner (Fire et al. Nature 391 :806-81 1, Carthew, Curr. Opin. in Cell 
25 Biol., 13:244-248, Elbashir et al., Nature 41 1 :494-498). A RNAi suppressing construct 
can be designed in a number of ways, for example, transcription of a inverted repeat 
which can form a long hair pin molecule, inverted repeats separated by a spacer sequence 
that could be an unrelated sequence such as GUS or an intron sequence. Transcription of 
sense and antisense strands by opposing promoters or cotranscription of sense and 
30 antisense genes. 

CPP Ribozymes and PNA moieties 
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In still another embodiment, an antisense nucleic acid of the invention is a 
ribozyme. Ribozymes are catalytic RNA molecules with ribonuclease activity that are 
capable of cleaving a single-stranded nucleic acid, such as a mRNA, to which they have 
a complementary region. Thus, ribozymes (e.g., hammerhead ribozymes (described in 
5 Haselhoff and Gerlach (1988) Nature 334:585-591)) can be used to catalytically cleave 
CPP mRNA transcripts to thereby inhibit translation of CPP mRNA. A ribozyme having 
specificity for a CPP-encoding nucleic acid can be designed based upon the nucleotide 
sequence of a CPP DNA disclosed herein (i.e., SEQ ID NO: 1, SEQ ID NO:14 or SEQ 
ID NO: 1 7). For example, a derivative of a Tetrahymena L-l 9 IVS RNA can be 

10 constructed in which the nucleotide sequence of the active site is complementary to the 
nucleotide sequence to be cleaved in a CPP-encoding mRNA. See, e.g., Cech et aL U.S. 
Pat No. 4,987,071; and Cech et aL U.S. Pat. No. 5,1 16,742. Alternatively, CPP mRNA 
can be used to select a catalytic RNA having a specific ribonuclease activity from a pool 
of RNA molecules. See, e.g., Bartel et aL, (1993) Science 261:1411-1418. 

15 Alternatively, CPP gene expression can be inhibited by targeting nucleotide 

sequences complementary to the regulatory region of the CPP (e.g., the CPP promoter 
and/or enhancers) to form triple helical structures that prevent transcription of the CPP 
gene in target cells. See generally, Helene. (1991) Anticancer Drug Des. 6: 569-84; 
Helene. et aL (1992) Ann. N. Y. Acad. Set 660:27-36; and Maher (1992) Bioassays 14: 

20 807-15. 

In various embodiments, the nucleic acids of CPP can be modified at the base 
moiety, sugar moiety or phosphate backbone to improve, e.g., the stability, hybridization, 
or solubility of the molecule. For example, the deoxyribose phosphate backbone of the 
nucleic acids can be modified to generate peptide nucleic acids (see Hyrup et aL (1996) 

25 Bioorg Med Chem 4: 5-23). As used herein, the terms "peptide nucleic acids" or "PNAs" 
refer to nucleic acid mimics, e.g., DNA mimics, in which the deoxyribose phosphate 
backbone is replaced by a pseudopeptide backbone and only the four natural nucleobases 
are retained. The neutral backbone of PNAs has been shown to allow for specific 
hybridization to DNA and RNA under conditions of low ionic strength. The synthesis of 

30 PNA oligomers can be performed using standard solid phase peptide synthesis protocols 
as described in Hyrup et aL (1996) above; Perry-O'Keefe et aL (1996) PNAS 93: 
14670-675. 

PNAs of CPP can be used in therapeutic and diagnostic applications. For 
example, PNAs can be used as antisense or antigene agents for sequence-specific 
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modulation of gene expression by, e.g., inducing transcription or translation arrest or 
inhibiting replication. PNAs of CPP can also be used, e.g., in the analysis of single base 
pair mutations in a gene by, e.g., PNA directed PCR clamping; as artificial restriction 
enzymes when used in combination with other enzymes, e.g., SI nucleases (Hyrup B. 
5 (1996) above); or as probes or primers for DNA sequence and hybridization (Hyrup et al. 
(1996), above; Perry-O'Keefe (1996), above). 

In another embodiment, PNAs of CPP can be modified, e.g., to enhance their 
stability or cellular uptake, by attaching lipophilic or other helper groups to PNA, by the 
formation of PNA-DNA chimeras, or by the use of liposomes or other techniques of drug 

10 delivery known in the art. For example, PNA-DNA chimeras of CPP can be generated 
that may combine the advantageous properties of PNA and DNA. Such chimeras allow 
DNA recognition enzymes, e.g., RNase H and DNA polymerases, to interact with the 
DNA portion while the PNA portion would provide high binding affinity and specificity. 
PNA-DNA chimeras can be linked using linkers of appropriate lengths selected in terms 

15 of base stacking, number of bonds between the nucleobases, and orientation (Hyrup 

(1996) above). The synthesis of PNA-DNA chimeras can be performed as described in 
Hyrup (1996) above and Finn et al. (1996) Nucl Acids Res 24: 3357-63. For example, a 
DNA chain can be synthesized on a solid support using standard phosphoramidite 
coupling chemistry, and modified nucleoside analogs, e.g., 5 f -(4-methoxytrityl) 

20 amino-5'-deoxy-thymidine phosphoramidite, can be used between the PNA and the 5' 
end of DNA (Mag et al. (1989) Nucl Acid Res 17: 5973-88). PNA monomers are then 
coupled in a stepwise manner to produce a chimeric molecule with a 5 f PNA segment 
and a 3' DNA segment (Finn et al. (1996) above). Alternatively, chimeric molecules can 
be synthesized with a 5' DNA segment and a 3' PNA segment. See, Petersen et al. 

25 (1975) Bioorg Med Chem Lett 5: 1119-11124. 
CPP Polypeptides 

A CPP polypeptide of the invention includes the protein whose sequence is 
provided in SEQ ID NO: 2, SEQ ID NO:15 or SEQ ID NO:18. The invention also 
includes a mutant or variant protein any of whose residues may be changed from the 
30 corresponding residue shown in SEQ ID NO: 2, SEQ ID NO: 1 5 or SEQ ID NO: 1 8 while 
still encoding a protein that maintains its CPP-like activities and physiological functions, 
or a functional fragment thereof. In some embodiments, up to 20% or more of the 
residues may be so changed in the mutant or variant protein. In some embodiments, the 
CPP polypeptide according to the invention is a mature polypeptide. 
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In general, a CPP -like variant that preserves CPP-like function includes any 
variant in which residues at a particular position in the sequence have been substituted by 
other amino acids, and further include the possibility of inserting an additional residue or 
residues between two residues of the parent protein as well as the possibility of deleting 
5 one or more residues from the parent sequence. Any amino acid substitution, insertion, 
or deletion is encompassed by the invention. In favorable circumstances, the substitution 
is a conservative substitution as defined above. 

One aspect of the invention pertains to isolated CPP proteins, and biologically 
active portions thereof, or derivatives, fragments, analogs or homologs thereof. Also 

10 provided are polypeptide fragments suitable for use as immunogens to raise anti-CPP 
antibodies. In one embodiment, native CPP proteins can be isolated from cells or tissue 
sources by an appropriate purification scheme using standard protein purification 
techniques. In another embodiment, CPP proteins are produced by recombinant DNA 
techniques. Alternative to recombinant expression, a CPP protein or polypeptide can be 

15 synthesized chemically using standard peptide synthesis techniques. 

An "isolated" or "purified" protein or biologically active portion thereof is 
substantially free of cellular material or other contaminating proteins from the cell or 
tissue source from which the CPP protein is derived, or substantially free from chemical 
precursors or other chemicals when chemically synthesized. The language "substantially 

20 free of cellular material" includes preparations of CPP protein in which the protein is 
separated from cellular components of the cells from which it is isolated or 
recombinantly produced. In one embodiment, the language "substantially free of cellular 
material" includes preparations of CPP protein having less than about 30% (by dry 
weight) of non-CPP protein (also referred to herein as a "contaminating protein"), more 

25 preferably less than about 20% of non-CPP protein, still more preferably less than about 
10% of non-CPP protein, and most preferably less than about 5% non-CPP protein. 
When the CPP protein or biologically active portion thereof is recombinantly produced, 
it is also preferably substantially free of culture medium, i.e., culture medium represents 
less than about 20%, more preferably less than about 10%, and most preferably less than 

30 about 5% of the volume of the protein preparation. 

The language "substantially free of chemical precursors or other chemicals" 
includes preparations of CPP protein in which the protein is separated from chemical 
precursors or other chemicals that are involved in the synthesis of the protein. In one 
embodiment, the language "substantially free of chemical precursors or other chemicals" 
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includes preparations of CPP protein having less than about 30% (by dry weight) of 
chemical precursors or non-CPP chemicals, more preferably less than about 20% 
chemical precursors or non-CPP chemicals, still more preferably less than about 10% 
chemical precursors or non-CPP chemicals, and most preferably less than about 5% 
5 chemical precursors or non-CPP chemicals. 

Biologically active portions of a CPP protein include peptides comprising amino 
acid sequences sufficiently homologous to or derived from the amino acid sequence of 
the CPP protein, e.g., the amino acid sequence shown in SEQ ID NO: 2 that include 
fewer amino acids than the full length CPP proteins, and exhibit at least one activity of a 

10 CPP protein, e.g. substrate binding. Typically, biologically active portions comprise a 
domain or motif with at least one activity of the CPP protein. A biologically active 
portion of a CPP protein can be a polypeptide which is, for example, 10, 25, 50, 100 or 
more amino acids in length. 

A biologically active portion of a CPP protein of the present invention may 

15 contain at least one of the above-identified domains conserved between the CPP proteins, 
e.g.. Moreover, other biologically active portions, in which other regions of the protein 
are deleted, can be prepared by recombinant techniques and evaluated for one or more of 
the functional activities of a native CPP protein. 

A biologically active portion or a CPP protein can be the N-terminal domain of 

20 the CPP polypeptide. Alternatively, a biologically active portion or a CPP protein can be 
the C-terminal domain of the CPP polypeptide. Preferably, the biologically active 
portion comprises at least 75 amino acids of the C- terminal domain. More preferably, 
the biologically active portion comprises at least 25 amino acids of the C- terminal 
domain. Most preferably, the biologically active portion comprises at least 10 amino 

25 acids of the C- terminal. 

In an embodiment, the CPP protein has an amino acid sequence shown in SEQ ID 
NO: 2, SEQ ID NO: 1 5 or SEQ ID NO: 1 8. In other embodiments, the CPP protein is 
substantially homologous to SEQ ID NO: 2, SEQ ID NO: 15 or SEQ ID NO: 18 and 
retains the functional activity of the protein of SEQ ID NO: 2, SEQ ID NO: 15 or SEQ ID 

30 NO: 18, yet differs in amino acid sequence due to natural allelic variation or mutagenesis, 
as described in detail below. Accordingly, in another embodiment, the CPP protein is a 
protein that comprises an amino acid sequence at least about 45% homologous to the 
amino acid sequence of S SEQ ID NO: 2, SEQ ID NO: 15 or SEQ ID NO: 18 and retains 
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the functional activity of the CPP proteins of SEQ ID NO: 2, SEQ ID NO: 15 or SEQ ID 
NO:18. 

Exemplary homologous CPP polypeptides include for example the polypeptide 
sequences of SEQ ID NO: 69, 71, 73 and 75. 
5 Determining homology between two or more sequence 

To determine the percent homology of two amino acid sequences or of two 
nucleic acids, the sequences are aligned for optimal comparison purposes (e.g., gaps can 
be introduced in either of the sequences being compared for optimal alignment between 
the sequences). The amino acid residues or nucleotides at corresponding amino acid 
10 positions or nucleotide positions are then compared. When a position in the first 

sequence is occupied by the same amino acid residue or nucleotide as the corresponding 
position in the second sequence, then the molecules are homologous at that position (i.e., 
as used herein amino acid or nucleic acid "homology" is equivalent to amino acid or 
nucleic acid "identity"). 

15 The nucleic acid sequence homology may be determined as the degree of identity 

between two sequences. The homology may be determined using computer programs 
known in the art, such as GAP software provided in the GCG program package. See, 
Needleman and Wunsch 1970 J Mo! Biol 48: 443-453. Using GCG GAP software with 
the following settings for nucleic acid sequence comparison: GAP creation penalty of 5.0 

20 and GAP extension penalty of 0.3, the coding region of the analogous nucleic acid 

sequences referred to above exhibits a degree of identity preferably of at least 70%, 75%, 
80%, 85%, 90%, 95%, 98%, or 99%, with the CDS (encoding) part of the DNA sequence 
shown in SEQ ID NO:l or SEQ ID NO:14 or SEQ ID NO:17. 

The term "sequence identity" refers to the degree to which two polynucleotide or 

25 polypeptide sequences are identical on a residue-by-residue basis over a particular region 
of comparison. The term "percentage of sequence identity" is calculated by comparing 
two optimally aligned sequences over that region of comparison, determining the number 
of positions at which the identical nucleic acid base (e.g., A, T, C, G, U, or I, in the case 
of nucleic acids) occurs in both sequences to yield the number of matched positions, 

30 dividing the number of matched positions by the total number of positions in the region 

of comparison (i.e., the window size), and multiplying the result by 100 to yield the 

percentage of sequence identity. The term "substantial identity" as used herein denotes a 

characteristic of a polynucleotide sequence, wherein the polynucleotide comprises a 

sequence that has at least 80 percent sequence identity, preferably at least 85 percent 
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identity and often 90 to 95 percent sequence identity, more usually at least 99 percent 
sequence identity as compared to a reference sequence over a comparison region. The 
term "percentage of positive residues" is calculated by comparing two optimally aligned 
sequences over that region of comparison, determining the number of positions at which 
5 the identical and conservative amino acid substitutions, as defined above, occur in both 
sequences to yield the number of matched positions, dividing the number of matched 
positions by the total number of positions in the region of comparison (i.e., the window 
size), and multiplying the result by 100 to yield the percentage of positive residues. 



10 Chimeric and fusion proteins 

The invention also provides CPP chimeric or fusion proteins. As used herein, a 
CPP "chimeric protein" or "fusion protein" comprises a CPP polypeptide operatively 
linked to a non-CPP polypeptide. An "CPP polypeptide" refers to a polypeptide having 
an amino acid sequence corresponding to CPP, whereas a "non-CPP polypeptide" refers 

15 to a polypeptide having an amino acid sequence corresponding to a protein that is not 
substantially homologous to the CPP protein, e.g., a protein that is different from the 
CPP protein and that is derived from the same or a different organism. Within a CPP 
fusion protein the CPP polypeptide can correspond to all or a portion of a CPP protein. 
In one embodiment, a CPP fusion protein comprises at least one biologically active 

20 portion of a CPP protein. In another embodiment, a CPP fusion protein comprises at 
least two biologically active portions of a CPP protein. Within the fusion protein, the 
term "operatively linked" is intended to indicate that the CPP polypeptide and the 
non-CPP polypeptide are fused in-frame to each other. The non-CPP polypeptide can be 
fused to the N-terminus or C-terminus of the CPP polypeptide. 

25 A CPP chimeric or fusion protein of the invention can be produced by standard 

recombinant DNA techniques. For example, DNA fragments coding for the different 
polypeptide sequences are ligated together in-frame in accordance with conventional 
techniques, e.g., by employing blunt-ended or stagger-ended termini for ligation, 
restriction enzyme digestion to provide for appropriate termini, filling-in of cohesive 

30 ends as appropriate, alkaline phosphatase treatment to avoid undesirable joining, and 
enzymatic ligation. In another embodiment, the fusion gene can be synthesized by 
conventional techniques including automated DNA synthesizers. Alternatively, PCR 
amplification of gene fragments can be carried out using anchor primers that give rise to 
complementary overhangs between two consecutive gene fragments that can 
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subsequently be annealed and reamplified to generate a chimeric gene sequence (see, for 
example, Ausubel et al. (eds.) Current Protocols in Molecular Biology, John 
Wiley & Sons, 1992). Moreover, many expression vectors are commercially available 
that already encode a fusion moiety (e.g., a GST polypeptide). A CPP-encoding nucleic 
5 acid can be cloned into such an expression vector such that the fusion moiety is linked 
in-frame to the CPP protein. 



CPP agonists and antagonists 

The present invention also pertains to variants of the CPP proteins that function 
10 as either CPP agonists (mimetics) or as CPP antagonists. Variants of the CPP protein 
can be generated by mutagenesis, e.g., discrete point mutation or truncation of the CPP 
protein. An agonist of the CPP protein can retain substantially the same, or a subset of, 
the biological activities of the naturally occurring form of the CPP protein. An 
antagonist of the CPP protein can inhibit one or more of the activities of the naturally 
15 occurring form of the CPP protein by, for example, competitively binding to a 

downstream or upstream member of a cellular signaling cascade which includes the CPP 
protein. Thus, specific biological effects can be elicited by treatment with a variant of 
limited function. 

Variants of the CPP protein that function as either CPP agonists (mimetics) or as 
20 CPP antagonists can be identified by screening combinatorial libraries of mutants, e.g., 
truncation mutants, of the CPP protein for CPP protein agonist or antagonist activity. In 
one embodiment, a variegated library of CPP variants is generated by combinatorial 
mutagenesis at the nucleic acid level and is encoded by a variegated gene library. A 
variegated library of CPP variants can be produced by, for example, enzymatically 
25 ligating a mixture of synthetic oligonucleotides into gene sequences such that a 

degenerate set of potential CPP sequences is expressible as individual polypeptides, or 
alternatively, as a set of larger fusion proteins (e.g., for phage display) containing the set 
of CPP sequences therein. There are a variety of methods which can be used to produce 
libraries of potential CPP variants from a degenerate oligonucleotide sequence. 
30 Chemical synthesis of a degenerate gene sequence can be performed in an automatic 
DNA synthesizer, and the synthetic gene then ligated into an appropriate expression 
vector. Use of a degenerate set of genes allows for the provision, in one mixture, of all 
of the sequences encoding the desired set of potential CPP sequences. Methods for 
synthesizing degenerate oligonucleotides are known in the art (see, e.g., Narang (1983) 
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Tetrahedron 39:3; Itakura et al. (1984) Annu Rev Biochem 53:323; Itakura et ah (1984) 
Science 198:1056; Ike et al (1983) Nucl Acid Res 11:477. 

Polypeptide libraries 

5 In addition, libraries of fragments of the CPP protein coding sequence can be 

used to generate a variegated population of CPP fragments for screening and subsequent 
selection of variants of a CPP protein. In one embodiment, a library of coding sequence 
fragments can be generated by treating a double stranded PCR fragment of a CPP coding 
sequence with a nuclease under conditions wherein nicking occurs only about once per 

10 molecule, denaturing the double stranded DNA, renaturing the DNA to form double 
stranded DNA that can include sense/antisense pairs from different nicked products, 
removing single stranded portions from reformed duplexes by treatment with SI 
nuclease, and ligating the resulting fragment library into an expression vector. By this 
method, an expression library can be derived which encodes N-terminal and internal 

15 fragments of various sizes of the CPP protein. 

Several techniques are known in the art for screening gene products of 
combinatorial libraries made by point mutations or truncation, and for screening cDNA 
libraries for gene products having a selected property. Such techniques are adaptable for 
rapid screening of the gene libraries generated by the combinatorial mutagenesis of CPP 

20 proteins. The most widely used techniques, which are amenable to high throughput 

analysis, for screening large gene libraries typically include cloning the gene library into 
replicable expression vectors, transforming appropriate cells with the resulting library of 
vectors, and expressing the combinatorial genes under conditions in which detection of a 
desired activity facilitates isolation of the vector encoding the gene whose product was 

25 detected. Recrusive ensemble mutagenesis (REM), a new technique that enhances the 
frequency of functional mutants in the libraries, can be used in combination with the 
screening assays to identify CPP variants (Arkin and Yourvan (1992) PNAS 
89:781 1-7815; Delgrave et al (1993) Protein Engineering 6:327-331). 

30 CPP Antibodies 

CPP polypeptides, including chimeric polypeptides, or derivatives, fragments, 

analogs or homologs thereof, may be utilized as immunogens to generate antibodies that 

immunospecifically-bind these peptide components. Such antibodies include, e.g., 

polyclonal, monoclonal, chimeric, single chain, Fab fragments and a Fab expression 
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library. In a specific embodiment, fragments of the CPP polypeptides are used as 
immunogens for antibody production. Various procedures known within the art may be 
used for the production of polyclonal or monoclonal antibodies to a CPP polypeptides, or 
derivative, fragment, analog or homolog thereof. 
5 For the production of polyclonal antibodies, various host animals may be immunized by 
injection with the native peptide, or a synthetic variant thereof, or a derivative of the 
foregoing. Various adjuvants may be used to increase the immunological response and 
include, but are not limited to, Freund's (complete and incomplete), mineral gels (e.g., 
aluminum hydroxide), surface active substances (e.g., lysolecithin, pluronic polyols, 

10 polyanions, peptides, oil emulsions, dinitrophenol, etc.) and human adjuvants such as 
Bacille Calmette-Guehn and Corynebacterium parvum. 

For preparation of monoclonal antibodies directed towards a CPP polypeptides, 
or derivatives, fragments, analogs or homologs thereof, any technique that provides for 
the production of antibody molecules by continuous cell line culture may be utilized. 

15 Such techniques include, but are not limited to, the hybridoma technique (see, Kohler 
and Milstein, 1975. Nature 256: 495-497); the trioma technique; the human B-cell 
hybridoma technique (see, Kozbor, et al, 1983. Immunol Today 4: 72) and the EBV 
hybridoma technique to produce human monoclonal antibodies (see, Cole, et al, 1985. 
In: Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). Human 

20 monoclonal antibodies may be utilized in the practice of the present invention and may 
be produced by the use of human hybridomas (see, Cote, et al, 1983. Proc Natl Acad Sci 
USA 80: 2026-2030) or by transforming human B-cells with Epstein Barr Virus in vitro 
(see, Cole, et al, 1985. In: Monoclonal Antibodies and Cancer Therapy (Alan R. Liss, 
Inc., pp. 77-96). 

25 According to the invention, techniques can be adapted for the production of 

single-chain antibodies specific to a CPP polypeptides (see, e.g., U.S. Patent No. 
4,946,778). In addition, methodologies can be adapted for the construction of Fab 
expression libraries (see, e.g., Huse, et al, 1989. Science 246: 1275-1281) to allow rapid 
and effective identification of monoclonal Fab fragments with the desired specificity for 

30 a CPP polypeptides or derivatives, fragments, analogs or homologs thereof. Antibody 
fragments that contain the idiotypes to a CPP polypeptides may be produced by 
techniques known in the art including, e.g., (i) an F(ab')2 fragment produced by pepsin 
digestion of an antibody molecule; (//) an Fab fragment generated by reducing the 
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disulfide bridges of an F(ab')2 fragment; (Hi) an Fab fragment generated by the treatment 
of the antibody molecule with papain and a reducing agent and (zv) Fv fragments. 
In one embodiment, methodologies for the screening of antibodies that possess the 
desired specificity include, but are not limited to, enzyme-linked immunosorbent assay 
5 (ELISA) and other immunologically-mediated techniques known within the art. In a 
specific embodiment, selection of antibodies that are specific to a particular domain of a 
CPP polypeptides is facilitated by generation of hybridomas that bind to the fragment of 
a CPP polypeptides possessing such a domain. Antibodies that are specific for a domain 
within a CPP polypeptides, or derivative, fragments, analogs or homologs thereof, are 
10 also provided herein. The anti-CPP polypeptide antibodies may be used in methods 
known within the art relating to the localization and/or quantitation of a CPP 
polypeptide^. g., for use in measuring levels of the peptide within appropriate 
physiological samples, for use in diagnostic methods, for use in imaging the peptide, and 
the like). 

15 

CPP Recombinant Expression Vectors and Host Cells 

Another aspect of the invention pertains to vectors, preferably expression vectors, 
containing a nucleic acid encoding a CPP protein, or derivatives, fragments, analogs or 
homologs thereof. As used herein, the term "vector" refers to a nucleic acid molecule 

20 capable of transporting another nucleic acid to which it has been linked. Exemplary 

expression vector constructs include for example the constructs of SEQ ID NO: 4„ 5, 36, 
37, 39, 40, 441, 42, 44, 45, 47, 48, 50 , 51 and 53. Additional exemplary expression 
vector constructs include contructs comprising CPP anti-sense nucleic acid such as SEQ 
ID NO: 38. 43., 46, 49, 52. One type of vector is a "plasmid", which refers to a circular 

25 double stranded DNA loop into which additional DNA segments can be ligated. Another 
type of vector is a viral vector, wherein additional DNA segments can be ligated into the 
viral genome. Certain vectors are capable of autonomous replication in a host cell into 
which they are introduced (e.g., bacterial vectors having a bacterial origin of replication). 
Other vectors are integrated into the genome of a host cell upon introduction into the host 

30 cell, and thereby are replicated along with the host genome. Moreover, certain vectors 

are capable of directing the expression of genes to which they are operatively-linked. 

Such vectors are referred to herein as "expression vectors". In general, expression 

vectors of utility in recombinant DNA techniques are often in the form of plasmids. In 

the present specification, "plasmid" and "vector" can be used interchangeably as the 
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plasmid is the most commonly used form of vector. However, the invention is intended 
to include such other forms of expression vectors, such as viral vectors or plant 
transformation vectors, binary or otherwise, which serve equivalent functions. 

The recombinant expression vectors of the invention comprise a nucleic acid of 
5 the invention in a form suitable for expression of the nucleic acid in a host cell, which 
means that the recombinant expression vectors include one or more regulatory 
sequences, selected on the basis of the host cells to be used for expression, that is 
operatively-linked to the nucleic acid sequence to be expressed. Within a recombinant 
expression vector, "operably-linked" is intended to mean that the nucleotide sequence of 

10 interest is linked to the regulatory sequence(s) in a manner that allows for expression of 
the nucleotide sequence (e.g., in an in vitro transcription/translation system or in a host 
cell when the vector is introduced into the host cell). 

The term "regulatory sequence" is intended to includes promoters, enhancers and 
other expression control elements (e.g., polyadenylation signals). Such regulatory 

15 sequences are described, for example, in Goeddel, Gene Expression Technology: 

Methods in Enzymology 185, Academic Press, San Diego, Calif. (1990). Regulatory 
sequences include those that direct constitutive expression of a nucleotide sequence in 
many types of host cell and those that direct expression of the nucleotide sequence only 
in certain host cells (e.g., tissue- specific regulatory sequences). Examples of suitable 

20 promoters include for example constitutive promoters, ABA inducible promoters, tissue 
specific promters or guard cell specific promoters. It will be appreciated by those skilled 
in the art that the design of the expression vector can depend on such factors as the 
choice of the host cell to be transformed, the level of expression of protein desired, etc. 
The expression vectors of the invention can be introduced into host cells to thereby 

25 produce proteins or peptides, including fusion proteins or peptides, encoded by nucleic 
acids as described herein (e.g., CPP proteins, mutant forms of CPP proteins, fusion 
proteins, etc.). 

The recombinant expression vectors of the invention can be designed for 
expression of CPP proteins in prokaryotic or eukaryotic cells. For example, CPP 
30 proteins can be expressed in bacterial cells such as Escherichia coli, insect cells (using 
baculovirus expression vectors) yeast cells, plant cells or mammalian cells. Suitable host 
cells are discussed further in Goeddel, Gene Expression Technology: Methods in 
Enzymology 185, Academic Press, San Diego, Calif. (1990). Alternatively, the 
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recombinant expression vector can be transcribed and translated in vitro, for example 
using T7 promoter regulatory sequences and T7 polymerase. 

Expression of proteins in prokaryotes is most often carried out in Escherichia coli 
with vectors containing constitutive or inducible promoters directing the expression of 
5 either fusion or non-fusion proteins. Fusion vectors add a number of amino acids to a 
protein encoded therein, usually to the amino terminus of the recombinant protein. Such 
fusion vectors typically serve three purposes: (/") to increase expression of recombinant 
protein; (it) to increase the solubility of the recombinant protein; and (Hi) to aid in the 
purification of the recombinant protein by acting as a ligand in affinity purification. 

10 Often, in fusion expression vectors, a proteolytic cleavage site is introduced at the 
junction of the fusion moiety and the recombinant protein to enable separation of the 
recombinant protein from the fusion moiety subsequent to purification of the fusion 
protein. Such enzymes, and their cognate recognition sequences, include Factor Xa, 
thrombin and enterokinase. Typical fusion expression vectors include pGEX (Pharmacia 

15 Biotech Inc; Smith and Johnson, 1988. Gene 67: 31-40), pMAL (New England Biolabs, 
Beverly, Mass.) and pRIT5 (Pharmacia, Piscataway, N.J.) that fuse glutathione 
S-transferase (GST), maltose E binding protein, or protein A, respectively, to the target 
recombinant protein. 

Examples of suitable inducible non-fusion E. coli expression vectors include pTrc 

20 (Amrann et al, (1988) Gene 69:301-315) and pET 1 Id (Studier et al. 9 Gene Expression 
Technology: Methods in Enzymology 185, Academic Press, San Diego, Calif. 
(1990) 60-89). 

One strategy to maximize recombinant protein expression in E. coli is to express 
the protein in a host bacteria with an impaired capacity to proteolytically cleave the 

25 recombinant protein. See, e.g., Gottesman, Gene Expression Technology: Methods 
in Enzymology 185, Academic Press, San Diego, Calif. (1990) 1 19-128. Another 
strategy is to alter the nucleic acid sequence of the nucleic acid to be inserted into an 
expression vector so that the individual codons for each amino acid are those 
preferentially utilized in E. coli (see, e.g., Wada, et al. 9 1992. Nucl. Acids Res. 20: 

30 2111-2118). Such alteration of nucleic acid sequences of the invention can be carried 
out by standard DNA synthesis techniques. 

In another embodiment, the CPP expression vector is a yeast expression vector. 
Examples of vectors for expression in yeast Saccharomyces cerivisae include pYepSecl 
(Baldari, et aL, 1987. EMBOJ. 6: 229-234), pMFa (Kurjan and Herskowitz, 1982. Cell 



WO 03/012116 PCT/IB02/03887 

30: 933-943), pJRY88 (Schultz et al, 1987. Gene 54: 113-123), pYES2 (Invitrogen 
Corporation, San Diego, Calif.), and picZ (InVitrogen Corp, San Diego, Calif.). 
Alternatively, CPP can be expressed in insect cells using baculovirus expression vectors. 
Baculovirus vectors available for expression of proteins in cultured insect cells (e.g., SF9 
5 cells) include the pAc series (Smith, et a/., 1983. Mol. Cell. Biol. 3: 2156-2165) and the 
pVL series (Lucklow and Summers, 1989. Virology 170: 31-39). 
In yet another embodiment, a nucleic acid of the invention is expressed in mammalian 
cells using a mammalian expression vector. Examples of mammalian expression vectors 
include pCDM8 (Seed, 1987. Nature 329: 840) and pMT2PC (Kaufman, et a/., 1987. 

10 EMBOJ. 6: 187-195). When used in mammalian cells, the expression vector's control 
functions are often provided by viral regulatory elements. For example, commonly used 
promoters are derived from polyoma, adenovirus 2, cytomegalovirus, and simian virus 
40. For other suitable expression systems for both prokaryotic and eukaryotic cells see, 
e.g., Chapters 16 and 17 of Sambrook, et al, Molecular Cloning: A Laboratory 

15 Manual. 2nd ed., Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, N.Y., 1989. 

In yet another embodiment, a nucleic acid of the invention is expressed in plants 
cells using a plant expression vector. Examples of plant expression vectors systems 
include tumor inducing (Ti) plasmid or portion thereof found in Agrobacterium, 

20 cauliflower mosaic virus (CAMV) DNA and vectors such as pBI121 . 

For expression in plants, the recombinant expression cassette will contain in 
addition to the CPP nucleic acids, a plant promoter region, a transcription initiation site 
(if the coding sequence to transcribed lacks one), and a transcription 
termination/polyadenylation sequence. The termination/polyadenylation region may be 

25 obtained from the same gene as the promoter sequence or may be obtained from different 
genes. Unique restriction enzyme sites at the 5* and 3' ends of the cassette are typically 
included to allow for easy insertion into a pre-existing vector. 

Examples of suitable promotors include promoters from plant viruses such as the 35S 
promoter from cauliflower mosaic virus (CaMV). Odell, et al., Nature, 313: 810-812 
30 (1985). and promoters from genes such as rice actin (McElroy, et al., Plant Cell, 163-171 
(1990)); ubiquitin (Christensen, et al., Plant Mol. Biol., 12: 619-632 (1992); and 
Christensen, et al., Plant Mol. Biol., 18: 675-689 (1992)); pEMU (Last, et al., Theor. 
Appl. Genet., 81: 581-588 (1991)); MAS (Velten, et al., EMBO J., 3: 2723-2730 (1984)); 
maize H3 histone (Lepetit, et al., Mol. Gen. Genet., 231: 276-285 (1992); and 



WO 03/012116 PCT/IB02/03887 

Atanassvoa, et al., Plant Journal, 2(3): 291-300 (1992)), the 5 - or 3 f -promoter derived 
from T-DNA of Agrobacterium tumefaciens, the Smas promoter, the cinnamyl alcohol 
dehydrogenase promoter (U.S. Pat. No. 5,683,439), the Nos promoter, the rubisco 
promoter, the GRP1-8 promoter, ALS promoter, (WO 96/30530), a synthetic promoter, 
5 such as, Rsyn7, SCP and UCP promoters, ribulose- 1,3 -diphosphate carboxylase, 
fruit-specific promoters, heat shock promoters, seed-specific promoters and other 
transcription initiation regions from various plant genes, for example, include the various 
opine initiation regions, such as for example, octopine, mannopine, and nopaline. 
Additional regulatory elements that may be connected to a CPP encoding nucleic acid 

10 sequence for expression in plant cells include terminators, polyadenylation sequences, 
and nucleic acid sequences encoding signal peptides that permit localization within a 
plant cell or secretion of the protein from the cell. Such regulatory elements and 
methods for adding or exchanging these elements with the regulatory elements CPP gene 
are known, and include, but are not limited to, 3' termination and/or polyadenylation 

15 regions such as those of the Agrobacterium tumefaciens nopaline synthase (nos) gene 
(Bevan, et al., Nucl. Acids Res., 12: 369-385 (1983)); the potato proteinase inhibitor II 
(PINII) gene (Keil, et al., Nucl. Acids Res., 14: 5641-5650 (1986) and hereby 
incorporated by reference); and An,, et al., Plant Cell, 1 : 1 15-122 (1989)); and the CaMV 
19S gene (Mogen, et al., Plant Cell, 2: 1261-1272 (1990)). 

20 Plant signal sequences, including, but not limited to, signal-peptide encoding 

DNA/RNA sequences which target proteins to the extracellular matrix of the plant cell 
(Dratewka-Kos, et al., J. Biol. Chem., 264: 4896-4900 (1989)) and the Nicotiana 
plumbaginifolia extension gene (DeLoose, et al., Gene, 99: 95-100 (1991)), or signal 
peptides which target proteins to the vacuole like the sweet potato sporamin gene 

25 (Matsuka, et al., Proc. Nat'l Acad. Sci. (USA), 88: 834 (1991)) and the barley lectin gene 
(Wilkins, et al., Plant Cell, 2: 301-313 (1990)), or signals which cause proteins to be 
secreted such as that of PRIb (Lind, et al., Plant Mol. Biol., 18: 47-53 (1992)), or those 
which target proteins to the plastids such as that of rapeseed enoyl-ACP reductase 
(Verwaert, et al., Plant Mol. Biol., 26: 189-202 (1994)) are useful in the invention. 

30 In another embodiment, the recombinant expression vector is capable of directing 

expression of the nucleic acid preferentially in a particular cell type {e.g., tissue-specific 
regulatory elements are used to express the nucleic acid). Tissue-specific regulatory 
elements are known in the art. Especially useful in connection with the nucleic acids of 
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the present invention are expression systems which are operable in plants. These include 
systems which are under control of a tissue-specific promoter, as well as those which 
involve promoters that are operable in all plant tissues. 

Organ-specific promoters are also well known. For example, the patatin class I 
5 promoter is transcriptionally activated only in the potato tuber and can be used to target 
gene expression in the tuber (Bevan, M., 1986, Nucleic Acids Research 14:4625-4636). 
Another potato-specific promoter is the granule-bound starch synthase (GBSS) promoter 
(Visser, R.G.R, et al, 1991, Plant Molecular Biology 17:691-699). 
Other organ-specific promoters appropriate for a desired target organ can be isolated 

10 using known procedures. These control sequences are generally associated with genes 
uniquely expressed in the desired organ. In a typical higher plant, each organ has 
thousands of mRNAs that are absent from other organ systems (reviewed in Goldberg, 
P., 1986, Trans. R. Soc. London B314:343). 

For in situ production of the antisense mRNA of GST, those regions of the GST 

15 gene which are transcribed into GST mRNA, including the untranslated regions thereof, 
are inserted into the expression vector under control of the promoter system in a reverse 
orientation. The resulting transcribed mRNA is then complementary to that normally 
produced by the plant. 

The resulting expression system or cassette is ligated into or otherwise 

20 constructed to be included in a recombinant vector which is appropriate for plant 
transformation. The vector may also contain a selectable marker gene by which 
transformed plant cells can be identified in culture. Usually, the marker gene will encode 
antibiotic resistance. These markers include resistance to G418, hygromycin, bleomycin, 
kanamycin, and gentamicin. After transforming the plant cells, those cells having the 

25 vector will be identified by their ability to grow on a medium containing the particular 
antibiotic. Replication sequences, of bacterial or viral origin, are generally also included 
to allow the vector to be cloned in a bacterial or phage host, preferably a broad host 
range prokaryotic origin of replication is included. A selectable marker for bacteria 
should also be included to allow selection of bacterial cells bearing the desired construct. 

30 Suitable prokaryotic selectable markers also include resistance to antibiotics such as 
kanamycin or tetracycline. 

Other DNA sequences encoding additional functions may also be present in the 
vector, as is known in the art. For instance, in the case of Agrobacterium 
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transformations, T-DNA sequences will also be included for subsequent transfer to plant 
chromosomes. 

Another aspect of the invention pertains to host cells into which a recombinant 
expression vector of the invention has been introduced. The terms "host cell" and 
5 "recombinant host cell" are used interchangeably herein. It is understood that such terms 
refer not only to the particular subject cell but also to the progeny or potential progeny of 
such a cell. Because certain modifications may occur in succeeding generations due to 
either mutation or environmental influences, such progeny may not, in fact, be identical 
to the parent cell, but are still included within the scope of the term as used herein. 

10 Vector DNA can be introduced into prokaryotic or eukaryotic cells via conventional 
transformation or transfection techniques. As used herein, the terms "transformation" 
and "transfection" are intended to refer to a variety of art-recognized techniques for 
introducing foreign nucleic acid (e.g., DNA) into a host cell. 

A host cell of the invention, such as a prokaryotic or eukaryotic host cell in culture, 

1 5 can be used to produce (/. e. , express) a polypeptide of the invention encoded in a an open 
reading frame of a polynucleotide of the invention. Accordingly, the invention further 
provides methods for producing a polypeptide using the host cells of the invention. In 
one embodiment, the method comprises culturing the host cell of invention (into which a 
recombinant expression vector encoding a polypeptide of the invention has been 

20 introduced) in a suitable medium such that the polypeptide is produced. In another 

embodiment, the method further comprises isolating the polypeptide from the medium or 
the host cell. 

A number of types of cells may act as suitable host cells for expression of a 
polypeptide encoded by an open reading frame in a polynucleotide of the invention. 

25 Plant host cells include, for example, plant cells that could function as suitable hosts for 
the expression of a polynucleotide of the invention include epidermal cells, mesophyll 
and other ground tissues, and vascular tissues in leaves, stems, floral organs, and roots 
from a variety of plant species, such as Arabidopsis thaliana, Nicotiana tabacum, 
Brassica napus, Zea mays, Oryza sativa, Gossypium hirsutum and Glycine max, 

30 Alternatively, it may be possible to produce a polypeptide in lower eukaryotes such 

as yeast or in prokaryotes such as bacteria. Potentially suitable yeast strains include 
Saccharomyces cerevisiae, Schizosaccharomyces pombe, Kluyveromyces strains, 
Candida, or any yeast strain capable of expressing heterologous proteins. Potentially 
suitable bacterial strains include Escherichia coli, Bacillus subtilis, Salmonella 
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typhimurium, or any bacterial strain capable of expressing heterologous polypeptides. If 
the polypeptide is made in yeast or bacteria, it may be necessary to modify the 
polypeptide produced therein, for example by phosphorylation or glycosylation of the 
appropriate sites, in order to obtain a functional polypeptide, if the polypeptide is of 
5 sufficient length and conformation to have activity. Such covalent attachments may be 
accomplished using known chemical or enzymatic methods. 

A polypeptide may be prepared by culturing transformed host cells under culture 
conditions suitable to express the recombinant protein. The resulting expressed 
polypeptide or protein may then be purified from such culture (e.g., from culture medium 

10 or cell extracts) using known purification processes, such as gel filtration and ion 
exchange chromatography. The purification of the polypeptide or protein may also 
include an affinity column containing agents which will bind to the protein; one or more 
column steps over such affinity resins as concanavalin A-agarose, heparin-toyopearl® or 
Cibacrom blue 3GA Sepharose®; one or more steps involving hydrophobic interaction 

1 5 chromatography using such resins as phenyl ether, butyl ether, or propyl ether; or 
immunoaffinity chromatography. 

Alternatively, a polypeptide or protein may also be expressed in a form which 
will facilitate purification. For example, it may be expressed as a fusion protein 
containing a six-residue histidine tag. The histidine-tagged protein will then bind to a 

20 Ni-affinity column. After elution of all other proteins, the histidine-tagged protein can 
be eluted to achieve rapid and efficient purification. One or more reverse-phase high 
performance liquid chromatography (RP-HPLC) steps employing hydrophobic RP- 
HPLC media, e.g., silica gel having pendant methyl or other aliphatic groups, can be 
employed to further purify a polypeptide. Some or all of the foregoing purification steps, 

25 in various combinations, can also be employed to provide a substantially homogeneous 
isolated recombinant polypeptide. The protein or polypeptide thus purified is 
substantially free of other plant proteins or polypeptides and is defined in accordance 
with the present invention as "isolated." 

Transformed Plants Cells and Transgenic Plants 

30 The invention includes protoplast, plants cells, plant tissue and plants (e.g., 

monocots and dicots transformed with a CPP nucleic acid (Le, sense or antisense), a 
vector containing a CPP nucleic acid (/. e, sense or antisense)or an expression vector 
containing a CPP nucleic acid (i.e, sense or antisense). As used herein, "plant" is meant 
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to include not only a whole plant but also a portion thereof (i.e., cells, and tissues, 
including for example, leaves, stems, shoots, roots, flowers, fruits and seeds). 

The plant can be any plant type including, for example, species from the genera 
Cucurbita, Rosa, Vitis, Juglans, Fragaria, Lotus, Medicago, Onobrychis, Trifolium, 
5 Trigonella, Vigna, Citrus, Linum, Geranium, Manihot, Daucus, Arabidopsis, Brassica, 
Raphanus, Sinapis, Atropa, Capsicum, Datura, Hyoscyamus, Lycopersicon, Nicotiana, 
Solanum, Petunia, Digitalis, Majorana, Ciahorium, Helianthus, Lactuca, Bromus, 
Asparagus, Antirrhinum, Heterocallis, Nemesis, Pelargonium, Panieum, Pennisetum, 
Ranunculus, Senecio, Salpiglossis, Cucumis, Browaalia, Glycine, Pisum, Phaseolus, 
10 Lolium, Oryza, Zea, Avena, Hordeum, Secale, Triticum, Sorghum, Gossypium, Picea, 
Caco, and Populus. 

In some aspects of the invention, the transformed plant is resistant to biotic and 
abiotic stresses, e.g., chilling stress, salt stress, water stress (e.g., drought), disease, 
grazing pests and wound healing. Additionally, the invention also includes a transgenic 

15 plant that is resistant to pathogens such as for example fungi, bacteria, nematodes, 

viruses and parasitic weeds. Alternatively, the transgenic plant is resistant to herbicides 
or has delayed senesence. The transgenic plant has an increase in yield, productivity, 
biomass or ABA sensitivity. By resistant is meant the plant grows under stress 
conditions (e.g., high salt, decreased water, low temperatures) or under conditions that 

20 normally inhibit, to some degree, the growth of an untransformed plant. Methodologies 
to determine plant growth or response to stress include for example, height 
measurements, weight meaurements, leaf area, ability to flower, water use, transpiration 
rates and yield. 

The invention also includes cells, tissues, including for example, leaves, stems, 
25 shoots, roots, flowers, fruits and seeds and the progeny derived from the tranformed 
plant. 

Numerous methods for introducing foreign genes into plants are known and can 
be used to insert a gene into a plant host, including biological and physical plant 
transformation protocols. See, for example, Miki et al., (1993) "Procedure for 
30 Introducing Foreign DNA into Plants", In: Methods in Plant Molecular Biology and 
Biotechnology, Glick and Thompson, eds., CRC Press, Inc., Boca Raton, pages 67-88 
and Andrew Bent in, Clough SJ and Bent AF, 1998. Floral dipping: a simplified method 
for Agrobacterium-mediatcd transformation of Arabidopsis thaliana.. The methods 



42 



WO 03/012116 PCT/IB02/03887 

chosen vary with the host plant, and include chemical transfection methods such as 
calcium phosphate, polyethylene glycol (PEG) transformation, microorganism-mediated 
gene transfer such as Agrobacterium (Horsch, et al., Science, 227: 1229-31 (1985)), 
electroporation, protoplast transformation, micro-injection, flower dipping and biolistic 
5 bombardment. 



Agrobacterium-mediated Transformation 

The most widely utilized method for introducing an expression vector into plants 
is based on the natural transformation system of Agrobacterium. A. tumefaciens and A. 

10 rhizogenes are plant pathogenic soil bacteria which genetically transform plant cells. 
The Ti and Ri plasmids of A. tumefaciens and A. rhizogenes, respectfully, carry genes 
responsible for genetic transformation of plants. See, for example, Kado, Crit. Rev. 
Plant Sci., 10: 1-32 (1991). Descriptions of the Agrobacterium vector systems and 
methods for Agrobacterium-mediated gene transfer are provided in Gruber et al., supra; 

15 and Moloney, et al, Plant Cell Reports, 8: 238-242 (1989). 

Transgenic Arabidopsis plants can be produced easily by the method of dipping 
flowering plants into an Agrobacterium culture, based on the method of Andrew Bent in, 
Clough SJ and Bent AF, 1998. Floral dipping: a simplified method for Agrobacterium- 
mediated transformation of Arabidopsis thaliana. Wild type plants are grown until the 
20 plant has both developing flowers and open flowers. The plant are inverted for 1 minute 
into a solution of Agrobacterium culture carrying the appropriate gene construct. Plants 
are then left horizontal in a tray and kept covered for two days to maintain humidity and 
then righted and bagged to continue growth and seed development. Mature seed is bulk 
harvested. 

25 Direct Gene Transfer 

A generally applicable method of plant transformation is microprojectile- 
mediated transformation, where DNA is carried on the surface of microprojectiles 
measuring about 1 to 4 mu.m. The expression vector is introduced into plant tissues with 
a biolistic device that accelerates the microprojectiles to speeds of 300 to 600 m/s which 
30 is sufficient to penetrate the plant cell walls and membranes. (Sanford, et al., Part. Sci. 
Technol., 5: 27-37 (1987); Sanford, Trends Biotech, 6: 299-302 (1988); Sanford, 
Physiol. Plant, 79: 206-209 (1990); Klein, et al., Biotechnology, 10: 286-291 (1992)). 
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Another method for physical delivery of DNA to plants is sonication of target cells as 
described in Zang, et al., BioTechnology, 9: 996-996 (1991). Alternatively, liposome or 
spheroplast fusions have been used to introduce expression vectors into plants. See, for 
example, Deshayes, et al, EMBO J., 4: 2731-2737 (1985); and Christou, et al., Proc. 

5 Natl Acad. Sci. (USA), 84: 3962-3966 (1987). Direct uptake of DNA into protoplasts 
using CaCl.sub.2 precipitation, polyvinyl alcohol or poly-L-ornithine have also been 
reported. See, for example, Hain, et al., Mol. Gen. Genet., 199: 161 (1985); and Draper, 
et al., Plant Cell Physiol., 23: 451-458 (1982). 

Electroporation of protoplasts and whole cells and tissues has also been 

10 described. See, for example, Donn, et al., (1990) In: Abstracts of the Vllth Int;l. 

Congress on Plant Cell and Tissue Culture IAPTC, A2-38, page 53; D'Halluin et al., 
Plant Cell, 4: 1495-1505 (1992); and Spencer et al., Plant Mol. Biol., 24: 51-61 (1994). 



Particle Wounding/Agrobacterium Delivery 

15 Another useful basic transformation protocol involves a combination of 

wounding by particle bombardment, followed by use of Agrobacterium for DNA 
delivery, as described by Bidney, et al., Plant Mol. Biol., 18: 301-31 (1992). Useful 
plasmids for plant transformation include Bin 19. See Bevan, Nucleic Acids Research, 
12: 871 1-8721 (1984), and hereby incorporated by reference. 

20 In general, the intact meristem transformation method involves imbibing seed for 

24 hours in the dark, removing the cotyledons and root radical, followed by culturing of 
the meristem explants. Twenty-four hours later, the primary leaves are removed to 
expose the apical meristem. The explants are placed apical dome side up and bombarded, 
e.g., twice with particles, followed by co-cultivation with Agrobacterium. To start the 

25 co-cultivation for intact meristems, Agrobacterium is placed on the meristem. After 

about a 3-day co-cultivation period the meristems are transferred to culture medium with 
cefotaxime plus kanamycin for the NPTII selection. 

The split meristem method involves imbibing seed, breaking of the cotyledons to 
produce a clean fracture at the plane of the embryonic axis, excising the root tip and then 

30 bisecting the explants longitudinally between the primordial leaves. The two halves are 
placed cut surface up on the medium then bombarded twice with particles, followed by 
co-cultivation with Agrobacterium. For split meristems, after bombardment, the 
meristems are placed in an Agrobacterium suspension for 30 minutes. They are then 
removed from the suspension onto solid culture medium for three day co-cultivation. 
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After this period, the meristems are transferred to fresh medium with cefotaxime plus 
kanamycin for selection. 



Transfer by Plant Breeding 

5 Alternatively, once a single transformed plant has been obtained by the foregoing 

recombinant DNA method, conventional plant breeding methods can be used to transfer 
the gene and associated regulatory sequences via crossing and backcrossing. Such 
intermediate methods will comprise the further steps of: (1) sexually crossing the 
transgenic plant with a plant from a second taxon; (2) recovering reproductive material 

10 from the progeny of the cross; and (3) growing transgenic plants from the reproductive 
material. Where desirable or necessary, the agronomic characteristics of the second taxon 
can be substantially preserved by expanding this method to include the further steps of 
repetitively: (1) backcrossing the transgenic progeny with non-transgenic plants from the 
second taxon; and (2) selecting for expression of an associated marker gene among the 

15 progeny of the backcross, until the desired percentage of the characteristics of the second 
taxon are present in the progeny along with the gene or genes imparting marker gene 
trait. 

By the term "taxon" herein is meant a unit of botanical classification. It thus includes, 
genus, species, cultivars, varieties, variants and other minor taxonomic groups which 

20 lack a consistent nomenclature. 
Regeneration of Transformants 

The development or regeneration of plants from either single plant protoplasts or 
various explants is well known in the art (Weissbach and Weissbach, 1988). This 
regeneration and growth process typically includes the steps of selection of transformed 

25 cells, culturing those individualized cells through the usual stages of embryonic 
development through the rooted plantlet stage. Transgenic embryos and seeds are 
similarly regenerated. The resulting transgenic rooted shoots are thereafter planted in an 
appropriate plant growth medium such as soil. 

The development or regeneration of plants containing the foreign, exogenous 

30 gene that encodes a polypeptide of interest introduced by Agrobacterium from leaf 

explants can be achieved by methods well known in the art such as described (Horsch et 
al., 1985). In this procedure, transformants are cultured in the presence of a selection 
agent and in a medium that induces the regeneration of shoots in the plant strain being 
transformed as described (Fraley et al., 1983). In particular, U.S. Pat. No. 5,349,124 
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(specification incorporated herein by reference) details the creation of genetically 
transformed lettuce cells and plants resulting therefrom which express hybrid crystal 
proteins conferring insecticidal activity against Lepidopteran larvae to such plants. 

This procedure typically produces shoots within two to four months and those 
5 shoots are then transferred to an appropriate root-inducing medium containing the 
selective agent and an antibiotic to prevent bacterial growth. Shoots that rooted in the 
presence of the selective agent to form plantlets are then transplanted to soil or other 
media to allow the production of roots. These procedures vary depending upon the 
particular plant strain employed, such variations being well known in the art. 

10 Preferably, the regenerated plants are self-pollinated to provide homozygous 

transgenic plants, or pollen obtained from the regenerated plants is crossed to seed- 
grown plants of agronomically important, preferably inbred lines. Conversely, pollen 
from plants of those important lines is used to pollinate regenerated plants. A transgenic 
plant of the present invention containing a desired polypeptide is cultivated using 

15 methods well known to one skilled in the art. 

A preferred transgenic plant is an independent segregant and can transmit the 
CPP gene and its activity to its progeny. A more preferred transgenic plant is 
homozygous for the gene, and transmits that gene to all of its offspring on sexual mating. 
Seed from a transgenic plant may be grown in the field or greenhouse, and resulting 
20 sexually mature transgenic plants are self-pollinated to generate true breeding plants. The 
progeny from these plants become true breeding lines that are evaluated for increased 
expression of the CPP transgene. 

Method of Producing Transgenic Plants 

Also included in the invention are methods of producing a transgenic plant. The 
25 method includes introducing into one or more plant cells a compound that alters CaaX 
prenyl protease expression or activity in the plant to generate a transgenic plant cell and 
regenerating a transgenic plant from the transgenic cell. In some aspects the compound 
increases alters CaaX prenyl protease expression or activity. Alternatively, the 
compound decrease alters CaaX prenyl protease expression or activity. The compound 
30 can be, e.g., (i) a CaaX prenyl protease polypeptide; (ii) a nucleic acid encoding a CaaX 
prenyl protease polypeptide; (iii) a nucleic acid that increases expression of a nucleic 
acid that encodes a CaaX prenyl protease polypeptide ; (iv) a nucleic acid that decreases 
the expression of a nucleic acid that encodes a CaaX prenyl protease polypeptide; (v) a 
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CaaX prenyl protease antisense nucleic acid and derivatives, fragments, analogs and 
homologs thereof. A nucleic acid that increases expression of a nucleic acid that encodes 
a CaaX prenyl protease polypeptide includes, e.g., promoters, enhancers. The nucleic 
acid can be either endogenous or exogenous. Preferably, the compound is a CaaX prenyl 
5 protease polypeptide or a nucleic acid encoding a CaaX prenyl protease polypeptide. For 
example the compound comprises the nucleic acid sequence of SEQ ID NO:l, 14, or 17 
or fragement thereof. Alternatively, the compound is a CaaX prenyl protease antisence 
nucleic acid. For example the compound comprises the nucleic acid sequence of SEQ ID 
NO: 16, 19 or 20. 

10 In various aspects the transgenic plant has an altered phenotype as compared to a 

wild type plant (i.e. , untransformed). By altered phenotype is meant that the plant has a 
one or more characteristic that is different from the wild type plant. For example, the 
transgenic plant has an increased resistence to stress. Increased stress resistance is meant 
that the transgenic plant can grow under stress conditions (e.g., high salt, decreased 

15 water, low temperatures, high temperatures) or under conditions that normally inhibit the 
growth of an untransformed Stresses include, for example, chilling stress, heat stress, 
heat shock, salt stress, water stress (i.e } drought), nutritional stress, disease, grazing 
pests, wound healing, pathogens such as for example fungi, bacteria, nematodes, viruses 
or parasitic weed and herbicides. Methodologies to determine plant growth or response 

20 to stress include for example, height measurements, weight or biomass measurements, 
leaf area or number, ability to flower, water use, transpiration rates and yield. 
Alternatively, the transformed plant has an increased (i.e. 9 enhanced) ABA sensitivity. 
The enhanced ABA sensitivity is at the seedling growth stage. Alternatively, the 
enhanced ABA sensitivity is at the mature plant stage. Additional altered phenotypes 

25 include for example, enhanced vegetative growth (e.g., increased leaf number, thickness 
and overall biomass), delayed reproductive growth (e.g., flowering later); enhanced 
seedling vigor (e.g.,increased root biomass and length), enhanced lateral root formation 
and therefore soil penetration more extensive vascular system resulting in an enhanced 
transport system. 

30 The plant can be any plant type including, for example, species from the genera 

Cucurbita, Rosa, Vitis, Juglans, Fragaria, Lotus, Medicago, Onobrychis, Trifolium, 
Trigonella, Vigna, Citrus, Linum, Geranium, Manihot, Daucus, Arabidopsis, Brassica, 
Raphanus, Sinapis, Atropa, Capsicum, Datura, Hyoscyamus, Ly coper sicon, Nicotiana, 
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Solarium, Petunia, Digitalis, Majorana, Ciahorium, Helianthus, Lactuca, Bromus, 
Asparagus, Antirrhinum, Heterocallis, Nemesis, Pelargonium, Panieum, Pennisetum, 
Ranunculus, Senecio, Salpiglossis, Cucumis, Browaalia, Glycine, Pisum, Phaseolus, 
Lolium, Oryza, Zea, Avena, Hordeum, Secale, Triticum, Sorghum, Gossypium, Picea, 
5 Caco, and Populus. 



Screening Methods 

The isolated nucleic acid molecules of the invention can be used to express CPP 
protein (e.g., via a recombinant expression vector in a host cell), to detect CPP mRNA 
(e.g., in a biological sample) or a genetic lesion in a CPP gene, and to modulate CPP 
10 activity, as described further, below. In addition, the CPP proteins can be used to screen 
compounds that modulate the CPP protein activity or expression. In addition, the 
anti-CPP antibodies of the invention can be used to detect and isolate CPP proteins and 
modulate CPP activity. 

The invention provides a method (also referred to herein as a "screening assay") 
15 for identifying modulators, i.e., candidate or test compounds or agents (e.g., peptides, 
peptidomimetics, small molecules or other drugs) that bind to CPP proteins or have a 
stimulatory or inhibitory effect on, e.g., CPP protein expression or CPP protein activity. 
The invention also includes compounds identified in the screening assays described 
herein. 

20 In one embodiment, the invention provides assays for screening candidate or test 

compounds which bind to a CPP protein or polypeptide or biologically-active portion 
thereof. The test compounds of the invention can be obtained using any of the numerous 
approaches in combinatorial library methods known in the art, including: biological 
libraries; spatially addressable parallel solid phase or solution phase libraries; synthetic 

25 library methods requiring deconvolution; the "one-bead one-compound" library method; 
and synthetic library methods using affinity chromatography selection. The biological 
library approach is limited to peptide libraries, while the other four approaches are 
applicable to peptide, non-peptide oligomer or small molecule libraries of compounds. 
See, e.g., Lam, 1997 '. Anticancer Drug Design 12: 145. 

30 A "small molecule" as used herein, is meant to refer to a composition that has a 

molecular weight of less than about 5 kD and most preferably less than about 4 kD. 
Small molecules can be, e.g., nucleic acids, peptides, polypeptides, peptidomimetics, 
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carbohydrates, lipids or other organic or inorganic molecules. Libraries of chemical 
and/or biological mixtures, such as fungal, bacterial, or algal extracts, are known in the 
art and can be screened with any of the assays of the invention. 

Examples of methods for the synthesis of molecular libraries can be found in the art, for 
5 example in: DeWitt, et al, 1993. Proc. Natl. Acad. Scl U.S.A. 90: 6909; Erb, et aL, 

1994. Proc. Natl Acad. Scl U.S.A. 91: 11422; Zuckermann, et al, 1 994. J. Med. Chem. 

37: 2678; Cho, et ah, 1993. Science 261: 1303; Carrell, et al, 1994. Angew. Chem. Int. 

Ed. Engl 33: 2059; Carell, et al, 1994. Angew. Chem. Int. Ed. Engl 33: 2061; and 

Gallop, etal, 1994. J. Med. Chem. 37: 1233. 
10 Libraries of compounds may be presented in solution (e.g., Houghten, 1992. 

Biotechniques 13: 412-421), or on beads (Lam, 1991. Nature 354: 82-84), on chips 

(Fodor, 1993. Nature 364: 555-556), bacteria (Ladner, U.S. Patent No. 5,223,409), 

spores (Ladner, U.S. Patent 5,233,409), plasmids (Cull, et al, 1992. Proc. Natl Acad. 

Scl USA 89: 1865-1869) or on phage (Scott and Smith, 1990. Science 249: 386-390; 
15 Devlin, 1990. Science 249: 404-406; Cwirla, et al, 1990. Proc. Natl Acad. Scl USA. 

87: 6378-6382; Felici, 1991. J. Mol Biol. 222: 301-310; Ladner, U.S. Patent No. 

5,233,409.). 

In one embodiment, an assay is a cell-based assay in which a cell which expresses a CPP 
protein, or a biologically-active portion thereof, is contacted with a test compound and 

20 the ability of the test compound to bind to a CPP protein determined. The cell, for 

example, can be of mammalian origin, plant cell or a yeast cell. Determining the ability 
of the test compound to bind to the CPP protein can be accomplished, for example, by 
coupling the test compound with a radioisotope or enzymatic label such that binding of 
the test compound to the CPP protein or biologically-active portion thereof can be 

25 determined by detecting the labeled compound in a complex. For example, test 

compounds can be labeled with 125 I, 35 S, 14 C, or 3 H, either directly or indirectly, and the 
radioisotope detected by direct counting of radioemission or by scintillation counting. 
Alternatively, test compounds can be enzymatically-labeled with, for example, 
horseradish peroxidase, alkaline phosphatase, or luciferase, and the enzymatic label 

30 detected by determination of conversion of an appropriate substrate to product. In one 
embodiment, the assay comprises contacting a cell which expresses a CPP protein, or a 
biologically-active portion thereof, with a known compound which binds CPP to form an 
assay mixture, contacting the assay mixture with a test compound, and determining the 
ability of the test compound to interact with a CPP protein, wherein determining the 
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ability of the test compound to interact with a CPP protein comprises determining the 
ability of the test compound to preferentially bind to CPP protein or a biologically-active 
portion thereof as compared to the known compound. 

In another embodiment, an assay is a cell-based assay comprising contacting a 
5 cell expressing a CPP protein, or a biologically-active portion thereof, with a test 

compound and determining the ability of the test compound to modulate (e.g., stimulate 
or inhibit) the activity of the CPP protein or biologically-active portion thereof. 
Determining the ability of the test compound to modulate the activity of CPP or a 
biologically-active portion thereof can be accomplished, for example, by determining the 

10 ability of the CPP protein to bind to or interact with a CPP target molecule. As used 
herein, a "target molecule" is a molecule with which a CPP protein binds or interacts in 
nature, for example, a molecule on the surface of a cell which expresses a CPP 
interacting protein, a molecule on the surface of a second cell, a molecule in the 
extracellular milieu, a molecule associated with the internal surface of a cell membrane 

15 or a cytoplasmic molecule. A CPP target molecule can be a non-CPP molecule or a CPP 
protein or polypeptide of the invention In one embodiment, a CPP target molecule is a 
component of a signal transduction pathway that facilitates transduction of an 
extracellular signal (e.g. a signal generated by binding of a compound to a 
membrane-bound molecule) through the cell membrane and into the cell. The target, for 

20 example, can be a second intercellular protein that has catalytic activity or a protein that 
facilitates the association of downstream signaling molecules with CPP. 
Determining the ability of the CPP protein to bind to or interact with a CPP target 
molecule can be accomplished by one of the methods described above for determining 
direct binding. In one embodiment, determining the ability of the CPP protein to bind to 

25 or interact with a CPP target molecule can be accomplished by determining the activity 
of the target molecule. For example, the activity of the target molecule can be 
determined by detecting induction of a cellular second messenger of the target (Le. 
intracellular Ca 2+ , diacylglycerol, IP3, etc.), detecting catalytic/enzymatic activity of the 
target an appropriate substrate, detecting the induction of a reporter gene (comprising a 

30 CPP-responsive regulatory element operatively linked to a nucleic acid encoding a 
detectable marker, e.g., luciferase), or detecting a cellular response, for example, cell 
survival, cellular differentiation, or cell proliferation. 

In yet another embodiment, an assay of the invention is a cell-free assay 
comprising contacting a CPP protein or biologically-active portion thereof with a test 
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compound and determining the ability of the test compound to bind to the CPP protein or 
biologically-active portion thereof. Binding of the test compound to the CPP protein can 
be determined either directly or indirectly as described above. In one such embodiment, 
the assay comprises contacting the CPP protein or biologically-active portion thereof 
5 with a known compound which binds CPP to form an assay mixture, contacting the assay 
mixture with a test compound, and determining the ability of the test compound to 
interact with a CPP protein, wherein determining the ability of the test compound to 
interact with a CPP protein comprises determining the ability of the test compound to 
preferentially bind to CPP or biologically-active portion thereof as compared to the 

1 0 known compound. 

In still another embodiment, an assay is a cell-free assay comprising contacting 
CPP protein or biologically-active portion thereof with a test compound and determining 
the ability of the test compound to modulate (e.g. stimulate or inhibit) the activity of the 
CPP protein or biologically-active portion thereof. Determining the ability of the test 

15 compound to modulate the activity of CPP can be accomplished, for example, by 

determining the ability of the CPP protein to bind to a CPP target molecule by one of the 
methods described above for determining direct binding. In an alternative embodiment, 
determining the ability of the test compound to modulate the activity of CPP protein can 
be accomplished by determining the ability of the CPP protein further modulate a CPP 

20 target molecule. For example, the catalytic/enzymatic activity of the target molecule on 
an appropriate substrate can be determined as described above. 

In yet another embodiment, the cell-free assay comprises contacting the CPP 
protein or biologically-active portion thereof with a known compound which binds CPP 
protein to form an assay mixture, contacting the assay mixture with a test compound, and 

25 determining the ability of the test compound to interact with a CPP protein, wherein 
determining the ability of the test compound to interact with a CPP protein comprises 
determining the ability of the CPP protein to preferentially bind to or modulate the 
activity of a CPP target molecule. 

The cell-free assays of the invention are amenable to use of both the soluble form 

30 or the membrane-bound form of CPP protein. In the case of cell-free assays comprising 
the membrane-bound form of CPP protein, it may be desirable to utilize a solubilizing 
agent such that the membrane-bound form of CPP protein is maintained in solution. 
Examples of such solubilizing agents include non-ionic detergents such as 
n-octylglucoside, n-dodecylglucoside, n-dodecylmaltoside, 
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octanoyl-N-methylglucamide, decanoyl-N-methylglucamide, Triton® X-100, Triton® 
X-l 14, Thesit®, Isotridecypoly(ethylene glycol ether) n , N-dodecyl-- 
N,N-dimethyl-3-ammonio-l -propane sulfonate, 3-(3-cholamidopropyl) 
dimethylamminiol-1 -propane sulfonate (CHAPS), or 
5 3-(3-cholamidopropyl)dimethylamminiol-2-hydroxy-l -propane sulfonate (CHAPSO). 

In more than one embodiment of the above assay methods of the invention, it 
may be desirable to immobilize either CPP protein or its target molecule to facilitate 
separation of complexed from uncomplexed forms of one or both of the proteins, as well 
as to accommodate automation of the assay. Binding of a test compound to CPP protein, 

10 or interaction of CPP protein with a target molecule in the presence and absence of a 
candidate compound, can be accomplished in any vessel suitable for containing the 
reactants. Examples of such vessels include microtiter plates, test tubes, and 
micro-centrifuge tubes. In one embodiment, a fusion protein can be provided that adds a 
domain that allows one or both of the proteins to be bound to a matrix. For example, 

15 GST-CPP fusion proteins or GST-target fusion proteins can be adsorbed onto glutathione 
sepharose beads (Sigma Chemical, St. Louis, MO) or glutathione derivatized microtiter 
plates, that are then combined with the test compound or the test compound and either 
the non-adsorbed target protein or CPP protein, and the mixture is incubated under 
conditions conducive to complex formation (e.g., at physiological conditions for salt and 

20 pH). Following incubation, the beads or microtiter plate wells are washed to remove any 
unbound components, the matrix immobilized in the case of beads, complex determined 
either directly or indirectly, for example, as described, supra. Alternatively, the 
complexes can be dissociated from the matrix, and the level of CPP protein binding or 
activity determined using standard techniques. 

25 Other techniques for immobilizing proteins on matrices can also be used in the 

screening assays of the invention. For example, either the CPP protein or its target 
molecule can be immobilized utilizing conjugation of biotin and streptavidin. 
Biotinylated CPP protein or target molecules can be prepared from biotin-NHS 
(N-hydroxy-succinimide) using techniques well-known within the art (e.g., biotinylation 

30 kit, Pierce Chemicals, Rockford, 111.), and immobilized in the wells of 

streptavidin-coated 96 well plates (Pierce Chemical). Alternatively, antibodies reactive 
with CPP protein or target molecules, but which do not interfere with binding of the CPP 
protein to its target molecule, can be derivatized to the wells of the plate, and unbound 
target or CPP protein trapped in the wells by antibody conjugation. Methods for 
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detecting such complexes, in addition to those described above for the GST-immobilized 
complexes, include immunodetection of complexes using antibodies reactive with the 
CPP protein or target molecule, as well as enzyme-linked assays that rely on detecting an 
enzymatic activity associated with the CPP protein or target molecule. 
5 In another embodiment, modulators of CPP protein expression are identified in a 

method wherein a cell is contacted with a candidate compound and the expression of 
CPP mRNA or protein in the cell is determined. The level of expression of CPP mRNA 
or protein in the presence of the candidate compound is compared to the level of 
expression of CPP mRNA or protein in the absence of the candidate compound. The 

10 candidate compound can then be identified as a modulator of CPP mRNA or protein 
expression based upon this comparison. For example, when expression of CPP mRNA 
or protein is greater (i.e., statistically significantly greater) in the presence of the 
candidate compound than in its absence, the candidate compound is identified as a 
stimulator of CPP mRNA or protein expression. Alternatively, when expression of CPP 

15 mRNA or protein is less (statistically significantly less) in the presence of the candidate 
compound than in its absence, the candidate compound is identified as an inhibitor of 
CPP mRNA or protein expression. The level of CPP mRNA or protein expression in the 
cells can be determined by methods described herein for detecting CPP mRNA or 
protein. 

20 In yet another aspect of the invention, the CPP proteins can be used as "bait 

proteins" in a two-hybrid assay or three hybrid assay (see, e.g., U.S. Patent No. 
5,283,317; Zervos, et a/., 1993. Cell 72: 223-232; Madura, et aL, 1993. J. Biol Chem. 
268: 12046-12054; Bartel, et a/., 1993. Biotechniques 14: 920-924; Iwabuchi, et a/., 
1993. Oncogene 8: 1693-1696; and Brent WO 94/10300), to identify other proteins that 

25 bind to or interact with CPP ("CPP-binding proteins" or "CPP-bp") and modulate CPP 
activity. Such CPP-binding proteins are also likely to be involved in the propagation of 
signals by the CPP proteins as, for example, upstream or downstream elements of the 
CPP pathway. 

The two-hybrid system is based on the modular nature of most transcription 
30 factors, which consist of separable DNA-binding and activation domains. Briefly, the 
assay utilizes two different DNA constructs. In one construct, the gene that codes for 
CPP is fused to a gene encoding the DNA binding domain of a known transcription 
factor (e.g., GAL-4). In the other construct, a DNA sequence, from a library of DNA 
sequences, that encodes an unidentified protein ("prey" or "sample") is fused to a gene 
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that codes for the activation domain of the known transcription factor. If the "bait" and 
the "prey" proteins are able to interact, in vivo, forming a CPP-dependent complex, the 
DNA-binding and activation domains of the transcription factor are brought into close 
proximity. This proximity allows transcription of a reporter gene (e.g., LacZ) that is 
5 operably linked to a transcriptional regulatory site responsive to the transcription factor. 
Expression of the reporter gene can be detected and cell colonies containing the 
functional transcription factor can be isolated and used to obtain the cloned gene that 
encodes the protein which interacts with CPP. 

In yet another aspect of the invention are methods which utilize the transgenic 
10 plants of the invention to identify CPP-interacting components via genetic screening 
protocols. These components can be for example, regulatory elements which modify 
CPP-gene expression, interacting proteins which directly modify CPP activity or 
interacting proteins which modify components of the same signal transduction pathway 
and thereby exert an effect on the expression or activity of CPP. Briefly, genetic 
15 screening protocols are applied to the transgenic plants of the invention and in so doing 
identify related genes which are not identified using a wild type background for the 
screen. For example an activation tagged library (Weigel, et al. 9 2000. Plant PhysioL 
122: 1003-1013), can be produced using the transgenic plants of the invention as the 
genetic background. Plants are then screened for altered phenotypes from that displayed 
20 by the parent plants. Alternative methods of generating libraries from the transgenic 
plants of the invention can be used, for example, chemical or irradiation induced 
mutations, insertional inactivation or insertional activation methods. 

The invention further pertains to novel agents identified by the aforementioned 
screening assays and uses thereof. 

25 

Examples 

Example 1: RT-PCR amplification and cloning of CaaX prenyl proteases 

Total RNA was isolated from leaf tissue of Arabidopsis thaliana, Brassica napus 
and Glycine max, using the Qiagen RNeasy kit and used as template to amplify the CPP 
30 genes by RT-PCR. Reaction conditions were as follows; IX reaction buffer (lOmM Tris- 
HC1 pH 8.8, 1.5mM MgCl 2 , 50mM KC1), dNTP's at 200^iM, lpM AtCPP BamFW and 
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AtCPP SmaRV primers, 2.5U. Pfu DNA polymerase, and template plus water to a final 
volume of IOOjuL. Reactions were run at 1 minute 94°C, 1 minute 60°C, 1 minute 72°C, 
for 30 cycles. Primers used to PCR amplify Arabidopsis and Brassica sequences were 
those identified by SEQ ID NO:6 and SEQ ID NO:7. Primers used to PCR amplify the 
5 Glycine sequence were those identified by SEQ ID NO:54 and SEQ ID NO:55. PCR 
products were separated from the RT-PCR reaction mixture using the Qiagen PCR 
column spin kit and ligated into the prepared cloning vector, pBluescript KS+. The 
vector had been prepared by digestion with EcoKV and treated with Taq polymerase in 
the presence of dTTP to produce a 3' overhand suitable for ligation with the PCR 
10 products. The ligation products were transformed into E. coli DH5oc cells, positive 
colonies selected and the resulting inserts sequenced. The above methodology is 
applicable to obtain homologous sequences and may require alternative primers. 



Table 1. 

15 AtCPP BamFW: 5 f -AAAGGATCCATGGCGATTCCTTTCATGG-3' 

(SEQ ID NO:6) 

AtCPP SmaRV: 5'-AAACCCGGGTTAATCTGTCTTCTTGTCTTCTCCA-3' 
(SEQ ID NO:7) 

GmCPP SmaFW: S'-AAACCCGGGATGGCGTTTCCCTACATGGAAGCC - 
20 3' (SEQ ID NO:54) 

GmCPP SacRV: 5-AAAGAGCTCTTAGTCTTCCTTCTTATCCGGTTCG -3' 
(SEQ ID NO:55) 

Example 2: Vector Construction 

Construction of the pBI 121 -AtCPP construct (SEQ ID NO: 4) was prepared as 
25 follows. The pBI121 vector was digested with BamRl and Smal. The AtCPP, 1.4 kb 

DNA fragment from RT-PCR (SEQ ID NO: 1) was digested with BamHl and Smal and 
ligated into the pBI121 vector. The GUS sequence was then removed by digestion with 
Smal and EcolCRl and the vector ligated after purification of the vector from the GUS 
insert to produce the pBI121-AtCPP vector (Figure 1 A). This construct was used to 
30 further generate constructs expressing the CPP gene from Brassica and Glycine. To 

produce the pBI121-BnCPP construct (SEQ ID NO:47) primer pairs identified by SEQ 
ID NO:6 and SEQ ID NO:7 are used to PCR amplify the appropriate fragment which is 
ligated into the prepared parent vector. To produce the pBI121 -GmCPP construct (SEQ 
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ID NO:41) primer pairs identified by SEQ ID NO:54 and SEQ ID NO:55 are used to 
PCR amplify the appropriate fragment which is ligated into the prepared parent vector. 

Construction of the pBI121-antisense- AtCPP construct (SEQ ID NO:35). The 
antisense fragment was produced using PCR amplification with SEQ ID NO:l as 
5 template and primers identified as SEQ ID NO: 1 1 and SEQ ID NO: 1 2, listed in Table 2. 
This fragment was digested with BamHl and Smal and used to replace the sense 
fragment of the pBI121-AtCPP construct (SEQ ID NO: 4), to yield SEQ ID NO:35 
(Figure IB) . This construct, SEQ ID NO:35, was used to further generate constructs 
expressing the antisense CPP gene from Brassica and Glycine. To produce the pBI121- 
10 antisense-BnCPP construct (SEQ ID NO:49) primer pairs identified by SEQ ID NO:56 
and SEQ ID NO:57 are used to PCR amplify the appropriate fragment which is ligated 
into the prepared parent vector. To produce the pBI121-antisense-GmCPP construct 
(SEQ ID NO:43) primer pairs identified by SEQ ID NO:58 and SEQ ID NO:59 are used 
to PCR amplify the appropriate fragment which is ligated into the prepared parent vector. 

15 Construction of the pBI121-HP-AtCPP construct (SEQ ID NO: 5). The cloning 

strategy involved truncating the GUS gene of pBI121 and flanking the GUS sequence 
with a AtCPP fragment in the antisense orientation upstream of the GUS and in the sense 
orientation on the downstream side of GUS. The pBI121 vector was digested with Smal 
and Sacl, the GUS sequence and the vector fragments were purified from one another. 

20 The isolated GUS fragment was digested using EcoRV and the 1 079 bp. blunt ended 
EcoKV/Sacl fragment isolated. This was ligated back into the digested parent vector at 
the SmaVSacl sites. This intermediate vector was used in the subsequent production of 
the hair-pin vectors. The AtCPP fragment to be used as the gene specific hair-pin 
sequence was isolated by PCR. Primers identified as SEQ ID NO:8 and SEQ ID NO:9, 

25 listed in Table 2, were used to generate a 596 bp fragment. Cloning of the sense 

orientation fragment was achieved by digesting the PCR AtCPP fragment with Sacl and 
ligation into the Sacl site at the 3' end of GUS. To insert the same fragment upsteam of 
GUS, the BamHl site was opened and the ends blunted with Klenow. The PCR amplified 
AtCPP fragment was digested with EcolCRI, which is an isoschizomer of Sacl but leaves 

30 blunt ends, and ligated into the blunted BamHl site of the vector to yield the final 

construct (Figure 1C). The intermediate construct used to produce SEQ ID NO:5 above 
contained only the truncated GUS gene and no CPP sequences this intermediate vector 
was used to further generate constructs expressing hair-pin CPP gene constructs from 
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Brassica and Glycine. To produce the pBI121-HP-BnCPP construct (SEQ ID NO:48) 
primer pairs identified by SEQ ID NO:58 and SEQ ID NO:59 are used to PCR amplify 
the sense fragment and primer pairs identified by SEQ ID NO:60 and SEQ ID NO:61 are 
used to PCR amplify the antisense fragment. These fragments are cloned into the 
5 prepared intermediate vector described above. To produce the pBI121-HP-GmCPP 

construct (SEQ ID NO:42) primer pairs identified by SEQ ID NO:62 and SEQ ID NO:63 
are used to PCR amplify the sense fragment and primer pairs identified by SEQ ID 
NO:64 and SEQ ID NO:65 are used to PCR amplify the antisense fragment. These 
fragments are cloned into the prepared intermediate vector described above. 

10 The above vector constructs were modified to place the genes under the control of 
alternative promoters, such as, but not limited to, the RD29A or MuA . This was 
accomplished by excising the 35S promoter sequence and replacing it with an 
appropriate promoter sequence. In this way SEQ ID NO's:39 and 40 were generated and 
SEQ ID NO's:38, 41-53 can be constructed. 

15 Table 2 

AtCPP-HP-SacFW 5'-CTGGAGCTCTTTTACCGAGGTTGGGCCTTGATCC-3* 
(SEQ ID NO:8) 

AtCPP-HP-SacRV 5-ATTGAGCTCCCAATGTCCAAGCTCGTGTGCAATA- 
3' (SEQ ID NO:9) 

20 AtCPP-anti-SmaFW 5'-AAACCCGGGATGGCGATTCCTTTCATGG-3' 

(SEQ ID NO: 11) 

AtCPP-anti-BamRV 5 '- AAAGGATCCTTAATCTGTCTTCTTGTCTTCTCC A- 
3' 

(SEQ ID NO: 12) 

25 BnCPP-anti-SmaFW 5'-AAACCCGGGATGGCGATTCCTTTCATGG -3' 

(SEQ ID NO:56) 

BnCPP-anti-BamRV 5'-AAAGGATCCTTAATCTGTCTTCTTGTCTTCTCC - 
3' 

(SEQ ID NO:57) 

30 BnCPP-HP-Sac-FW 5'- AAAGAGCTCTTCTACCAATGGTGGGACTCG -3' 

(SEQ ID NO:58) 

BnCPP-HP-Sac-RV 5'- AAAGAGCTCCCAGTGTCCCAGCTCGTGTG -3' 
(SEQ ID NO:59) 

BnCPP-HP-BamFW 5*- AAAGGATCCTTCTACCAATGGTGGGACTCG -3' 
35 (SEQIDNO:60) 

BnCPP-HP-XbaRV 5'- AAATCTAGACCAGTGTCCCAGCTCGTGTG -3' 
(SEQ IDNO:61) 

GmCPP-HP-Sac-FW 

5'-GATGAGCTCACAAGATCAAGTCACAGCAATGCCT -3' 
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(SEQ ID NO:62) 

GmCPP-HP-Sac-RV 5'- AAAGAGCTCCCGGTTCGTCCAGCGCGGCC -3' 
(SEQ ID NO:63) 

GmCPP-HP-BamFW 

5'- GATGGATCCACAAGATCAAGTCACAGCAATGCCT -3' 
(SEQ ID NO:64) 

GmCPP-HP-XbaRV 5'- CCTTCTAGACCGGTTCGTCCAGCGCGGCC -3* 
(SEQ ID NO:65) 



10 Example 3: Sequence Analysis 

Arabidopsis thaliana CPP (AtCPP) 

A disclosed nucleic acid of 1275 nucleotides (SEQ ID NO:l) and also referred to 
as AtCPP, is shown in Table 3. 

Table 3 A, AtCPP Nucleotide Sequence (SEQ ID NO:l). 

ATGGCGATTCCTTTCATGGAAACCGTCGTGGGTTTTATGATAGTGATGTACATTTTTGAG 
ACGTATTTGGATCTGAGGCAACTCACTGCTCTCAAGCTTCCAACTCTCCCGAAAACCTTG 
G T T GG T G T AAT TAG C C AAG AG AAG T T T G AG AAAT C AC G AGC AT AC AG T C T T G AC AAAAG C 
TATTTTCACTTTGTTCATGAGTTTGTAACTATACTTATGGACTCTGCAATTTTGTTCTTT 
GGGATCTTGCCTTGGTTTTGGAAGATGTCTGGAGCTGTTTTACCGAGGTTGGGCCTTGAT 
CCGGAGAATGAAATACTGCATACTCTTTCATTCTTGGCTGGTGTTATGACATGGTCACAG 
ATCACTGATTTGCCATTTTCTTTGTACTCAACTTTCGTGATCGAGTCTCGGCATGGGTTC 
AAC AAAC AAAC AAT AT GG AT G T T CAT T AG GG AC AT GAT C AAAGGAAC AT TCCTCTCTGTC 
ATACTAGGCCCACCCATTGTTGCTGCGATAATTTTCATAGTCCAGAAAGGAGGTCCTTAT 
CTTGCCATCTATCTGTGGGCATTCATGTTTATCCTGTCTCTAGTGATGATGACTATATAC 
CCGGTCTTGATAGCACCGCTCTTCAACAAATTCACTCCTCTTCCAGATGGAGACCTCCGG 
GAGAAGATTGAGAAACTTGCTTCTTCCCTAAAGTTTCCTTTGAAGAAGCTGTTTGTTGTC 
GATGGATCTACAAGGTCAAGCCATAGCAATGCTTACATGTATGGTTTCTTTAAGAACAAA 
AGGATTGTTCTTTATGATACGTTGATTCAGCAGTGCAAGAATGAGGATGAAATTGTGGCG 
G T T A T T GC AC AC GAG C T T G G AC AT T GGAAAC T G AAT C AC AC T AC AT AC T C G T T CAT T G C A 
GTTCAAATCCTTGCCTTCTTACAATTTGGAGGATACACTCTTCTCAGAAACTCCACTGAT 
CTCTTCAGGAGTTTCGGATTTGATACACAGCCTGTTCTCATTGGTTTGATCATATTTCAG 
CACACTGTAATACCACTGCAACATCTAGTAAGCTTTGGCCTGAACCTCGTTAGTCGAGCG 
TTTGAGTTTCAGGCTGATGCTTTTGCTGTGAAGCTTGACTATGCAAAAGATCTTCGTCCT 
GCT C TAG T G AAAC T AC AGGAAG AGAAC T TAT C AAC AAT G AAC AC T GAT C CAT T G T AC T C A 
GCTTATCACTACTCACATCCTCCTCTTGTTGAAAGGCTTCGAGCCACTGATGGAGAAGAC 
AAGAAGACAGAT TAA 
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A disclosed CPP polypeptide (SEQ ID NO:2) encoded by SEQ ID NO:l has 424 amino 
acid residues and is presented in Table 3B using the one-letter amino acid code. 



Table 3B. Encoded CPP protein sequence (SEQ ID NO:2). 



MAIPFMETVVGFMIVMYIFETYLDLRQLTALKLPTLPKTLVGVISQEKFEKSRAYSLDKS 
YFHFVHEFVTILMDSAILFFGILPWFWKMSGAVLPRLGLDPENEILHTLSFLAGVMTWSQ 
ITDLPFSLYSTFVIESRHGFNKQTIWMFIRDMIKGTFLSVILGPPIVAAIIFIVQKGGPY 
LAI YLWAFMFILSLVMMTIYPVLIAPLFNKFTPLPDGDLREKIEKLASSLKFPLKKLFVV 
DGSTRSSHSNAYMYGFFKNKRIVLYDTLIQQCKNEDEIVAVIAHELGHWKLNHTTYSFIA 
VQILAFLQFGGYTLLRNSTDLFRSFGFDTQPVLIGLIIFQHTVI PLQHLVSFGLNLVSRA 
FEFQADAFAVKLDYAKDLRPALVKLQEENLSTMNTDPLYSAYHYSHPPLVERLRATDGED 
KKTD 



The present invention also includes a nucleic acid sequence complimentary to the 
Arabidopsis thaliana CaaX prenyl protease of SEQ ID NO: 1 . The disclosed 
complimentary sequence is shown as SEQ ID NO:20. 



SEQ ID NO:20 

TTAATCTGTCTTCTTGTCTTCTCCATCAGTGGCTCGAAGCCTTTCAACAAGAGGAGGAT 

10 GTGAGTAGTGATAAGCTGAGTACAATGGATCAGTGTTCATTGTTGATAAGTTCTCTTCC 
TGTAGTTTCACTAGAGCAGGACGAAGATCTTTTGCATAGTCAAGCTTCACAGCAAAAGC 
ATCAGCCTGAAACTCAAACGCTCGACTAACGAGGTTCAGGCCAAAGCTTACTAGATGTT 
GCAGTGGTATTACAGTGTGCTGAAATATGATCAAACCAATGAGAACAGGCTGTGTATCA 
AATCCGAAACTCCTGAAGAGATCAGTGGAGTTTCTGAGAAGAGTGTATCCTCCAAATTG 

15 TAAGAAGGCAAGGATTTGAACTGCAATGAACGAGTATGTAGTGTGATTCAGTTTCCAAT 
GTCCAAGCTCGTGTGCAATAACCGCCACAATTTCATCCTCATTCTTGCACTGCTGAATC 
AACGTATCATAAAGAACAATCCTTTTGTTCTTAAAGAAACCATACATGTAAGCATTGCT 
ATGGCTTGACCTTGTAGATCCATCGACAACAAACAGCTTCTTCAAAGGAAACTTTAGGG 
AAGAAGCAAGTTTCTCAATCTTCTCCCGGAGGTCTCCATCTGGAAGAGGAGTGAATTTG 

20 TTGAAGAGCGGTGCTATCAAGACCGGGTATATAGTCATCATCACTAGAGACAGGATAAA 
CATGAATGCCCACAGATAGATGGCAAGATAAGGACCTCCTTTCTGGACTATGAAAATTA 
TCGCAGCAACAATGGGTGGGCCTAGTATGACAGAGAGGAATGTTCCTTTGATCATGTCC 
CTAATGAACATCCATATTGTTTGTTTGTTGAACCCATGCCGAGACTCGATCACGAAAGT 
TGAGTACAAAGAAAATGGCAAATCAGTGATCTGTGACCATGTCATAACACCAGCCAAGA 

25 ATGAAAGAGTATGCAGTATTTCATTCTCCGGATCAAGGCCCAACCTCGGTAAAACAGCT 
CCAGACATCTTCCAAAACCAAGGCAAGATCCCAAAGAACAAAATTGCAGAGTCCATAAG 
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TATAGTTACAAACTCATGAACAAAGTGAAAATAGCTTTTGTCAAGACTGTATGCTCGTG 
ATTTCTCAAACTTCTCTTGGCTAATTACACCAACCAAGGTTTTCGGGAGAGTTGGAAGC 
TTGAGAGCAGTGAGTTGCCTCAGATCCAAATACGTCTCAAAAATGTACATCACTATCAT 
AAAACCCACGACGGTTTCCATGAAAGGAATCGCCAT 

5 

Due to the nature of the cloning strategy the sequence presented is not full length 
but is missing the 5 f and 3' non-translated regions. The percent identities of the 
Arabidopsis thaliana nucleotide sequence and its encoded amino acid sequence to that of 
other CPP sequences as determined by ClustalW analysis are shown in Figure 2. 
10 Using the sequences disclosed herein as hybridization probes, one is able to screen and 
isolate full length sequences from cDNA or genomic libraries or use the rapid 
amplification of cDNA ends (RACE) technology or other such PCR techniques. 

Brassica napus CPP (BnCPP) 

15 A disclosed nucleic acid of 1275 nucleotides (SEQ ID NO: 14) and also referred 

to as BnCPP, is shown in Table 4. 



Table 4A. BnCPP Nucleotide Sequence (SEQ ID NO: 14). 

ATGGCGATTCCTTTCATGGAAACCGTCGTTGGTTTTATGATAGTGATGTACGTTTTTGAGACGTA 
TTTGGATCTGAGGCAACATACTGCTCTCAAGCTTCCCACTCTCCCAAAGACTTTGGTTGGAGTCA 
TTAGCCAAGAGAAGTTTGAGAAATCTCGAGCTTACAGTCTTGACAAAAGCCATTTTCACTTTGTT 
CATGAGTTTGTTACTATACTTATGGACTCTGCGATTCTGTTCTTTGGGATCTTGCCTTGGTTTTG 
GAAGATATCTGGCGGCTTTCTACCAATGGTGGGACTCGATCCAGAGAATGAAATCCTGCACACTC 
TTTCATTCTTGGCTGGTCTTATGACATGGTCACAGATCACTGATTTGCCATTTTCTTTGTACTCA 
ACTTTCGTGATCGAGTCTCGGCATGGGTTCAACAAACAAACAATATGGATGTTCATTAGGGACAT 
GATCAAAGGAATACTCCTCTCTGTCATACCTGCCCCTCCTATCGTTGCCGCAATTATTGTTATAG 
TTCAGAAAGGAGGTCCTTACCTCGCCATCTATCTGTGGGCATTCATGTTTATCCTGTCTCTAGTG 
ATGATGACTATATACCCTGTTTTGATTGCACCTCTTTTCAACAAGTTCACTCCTCTTCCTGATGG 
AGACCTCCGGGAGAAGATTGAGAAACTTGCTTCTTCTCTAAAGTTTCCTCTGAAGAAGCTGTTTG 
TTGTCGATGGATCTACAAGGTCAAGCCATAGTAATGCTTACATGTATGGTTTCTTCAAGAACAAA 
AGGATTGTTCTTTATGACACATTGATTCAGCAGTGCCAGAATGAGAATGAAATTGTGGCGGTTAT 
TGCACACGAGCTGGGACACTGGAAGCTGAATCACACTACATACTCGTTCATTGCTGTTCAAATCC 
TTGCCTTCTTGCAATTTGGAGGATACACTCTTGTCAGAAACTCCACTGATCTCTTCAGGAGTTTT 
GGTTTTGATACACAACCAGTTCTCATTGGTTTGATCATATTTCAGCACACTGTAATACCACTTCA 
ACACCTAGTAAGCTTTGACCTCAACCTTGTTAGTCGAGCGTTTGAGTTTCAGGCTGATGCTTTTG 
CAGTGAATCTTGGTTATGCAAAGGATCTACGTCCTGCCCTAGTGAAGCTACAGGAAGAGAACTTA 
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TCAGCGATGAACACAGACCCATTGTACTCAGCTTATCACTACTCACACCCTCCTCTTGTAGAGAG 
G C T T C GAG C CAT T GAT GG AG AAG AC AAG AAG AC AG AT T AA 



A disclosed CPP polypeptide (SEQ ID NO: 15) encoded by SEQ ID NO: 14 has 
424 amino acid residues and is presented in Table 4B using the one-letter amino acid 
code. 

Table 4B. Encoded CPP protein sequence (SEQ ID NO:15). 

MAIPFMETVVGFMIVMYVFETYLDLRQHTALKLPTLPKTLVGVISQEKFEKSRAYSLDKSHFHF 
VHEFVTILMDSAILFFGILPWFWKISGGFLPMVGLDPENEILHTLSFLAGLMTWSQITDLPFSL 
YSTFVIESRHGFNKQTIWMFIRDMIKGILLSVIPAPPIVAAIIVIVQKGGPYLAIYLWAFMFIL 
SLVMMTIYPVLIAPLFNKFTPLPDGDLREKIEKLASSLKFPLKKLFVVDGSTRSSHSNAYMYGF 
FKNKRIVLYDTLIQQCQNENEIVAVIAHELGHWKLNHTTYSFIAVQILAFLQFGGYTLVRNSTD 
LFRSFGFDTQPVLIGLIIFQHTVI PLQHLVSFDLNLVSRAFEFQADAFAVNLGYAKDLRPALVK 
LQEENLSAMNTDPLYSAYHYSHPPLVERLRAIDGEDKKTD 

5 

The present invention also includes a nucleic acid sequence complimentary to the 
Brassica napus CaaX prenyl protease of SEQ ID NO: 14. The disclosed complimentary 
sequence is shown as SEQ ID NO: 16. 



10 SEQ ID NO: 16 

TTAATCTGTCTTCTTGTCTTCTCCATCAATGGCTCGAAGCCTCTCTACAAGAGGAGGGT 
GTGAGTAGTGATAAGCTGAGTACAATGGGTCTGTGTTCATCGCTGATAAGTTCTCTTCC 
TGTAGCTTCACTAGGGCAGGACGTAGATCCTTTGCATAACCAAGATTCACTGCAAAAGC 
ATCAGCCTGAAACTCAAACGCTCGACTAACAAGGTTGAGGTCAAAGCTTACTAGGTGTT 

15 GAAGTGGTATTACAGTGTGCTGAAATATGATCAAACCAATGAGAACTGGTTGTGTATCA 
AAACCAAAACTCCTGAAGAGATCAGTGGAGTTTCTGACAAGAGTGTATCCTCCAAATTG 
CAAGAAGGCAAGGATTTGAACAGCAATGAACGAGTATGTAGTGTGATTCAGCTTCCAGT 
GTCCCAGCTCGTGTGCAATAACCGCCACAATTTCATTCTCATTCTGGCACTGCTGAATC 
AATGTGTCATAAAGAACAATCCTTTTGTTCTTGAAGAAACCATACATGTAAGCATTACT 

20 ATGGCTTGACCTTGTAGATCCATCGACAACAAACAGCTTCTTCAGAGGAAACTTTAGAG 
AAGAAGCAAGTTTCTCAATCTTCTCCCGGAGGTCTCCATCAGGAAGAGGAGTGAACTTG 
TTGAAAAGAGGTGCAATCAAAACAGGGTATATAGTCATCATCACTAGAGACAGGATAAA 
CATGAATGCCCACAGATAGATGGCGAGGTAAGGACCTCCTTTCTGAACTATAACAATAA 
TTGCGGC7^ACGATAGGAGGGGCAGGTATGACAGAGAGGAGTATTCCTTTGATCATGTCC 

25 CTAATGAACATCCATATTGTTTGTTTGTTGAACCCATGCCGAGACTCGATCACGAAAGT 
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TGAGTACAAAGAAAATGGCAAATCAGTGATCTGTGACCATGTCATAAGACCAGCCAAGA 
ATGAAAGAGTGTGCAGGATTTCATTCTCTGGATCGAGTCCCACCATTGGTAGAAAGCCG 
CCAGATATCTTCCAAAACCAAGGCAAGATCCCAAAGAACAGAATCGCAGAGTCCATAAG 
TATAGTAACAAACTCATGAACAAAGTG7VAAATGGCTTTTGTCAAGACTGTAAGCTCGAG 
5 ATTTCTCAAACTTCTCTTGGCTAATGACTCCAACCAAAGTCTTTGGGAGAGTGGGAAGC 
TTGAGAGCAGTATGTTGCCTCAGATCCAAATACGTCTCAAAAACGTACATCACTATCAT 
AAAACCAACGACGGTTTCCATGAAAGGAATCGCCAT 



Due to the nature of the cloning strategy the sequence presented is not full length 
10 but is missing the 5 ! and 3' non-translated regions. The percent identities of the Brassica 
napus nucleotide sequence and its encoded amino acid sequence to that of other CPP 
sequences as determined by ClustalW analysis are shown in Figure 2. 
Using the sequences disclosed herein as hybridization probes, one is able to screen and 
isolate full length sequences from cDNA or genomic libraries or use the rapid 
15 amplification of cDNA ends (RACE) technology or other such PCR techniques. 



Glycine max CPP (GmCPP) 

A disclosed nucleic acid of 1275 nucleotides (SEQ ID NO: 17) and also referred 
to as GmCPP, is shown in Table 5. 

20 

Table 5A. GmCPP Nucleotide Sequence (SEQ ID NO:17). 

ATGGCGTTTCCCTACATGGAAGCCGTTGTCGGATTTATGATATTAATGTACATTTTTGAAACTTA 
CTTGGATGTGCGACAACATAGGGCCCTCAAACTTCCTACTCTTCCAAAGACTTTAGAGGGTGTTA 
TCAGCCAAGAGAAATTTGAGAAATCTAGAGCCTATAGTCTTGATAAAAGCCACTTCCATTTTGTT 
CACGAGTTTGTGACAATAGTGACAGACTCTACAATTTTGTACTTTGGGGTATTGCCCTGGTTTTG 
G AAGAAAT C AGG AG AT T T TAT G AC AAT AGC T G G T T T C AAT GC T G AG AAT G AAAT AC T G CAT AC C C 
TTGCCTTCTTAGCAGGGCTGATGATTTGGTCACAGATAACAGATTTGCCCTTTTCTCTGTACTCA 
ACTTTTGTGATTGAGGCCCGTCATGGTTTTAATAAGCAAACACCATGGTTATTCTTTAGGGACAT 
GCTTAAAGGAATTTTCCTTTCTGTAATAATTGGTCCACCTATTGTGGCTGCAATCATTGTAATAG 
TACAGAAAGGAGGTCCATACTTGGCCATCTATCTTTGGGTTTTTACGTTTGGTCTTTCTATTGTG 
ATGATGACCCTTTATCCAGTACTAATAGCTCCACTCTTCAATAAGTTCACTCCACTTCCAGATGG 
TCAACTCAGGGAGAAAATCGAGAAACTTGCTTCCTCCCTCAACTATCCGTTAAAGAAACTATTTG 
TTGTCGATGGATCCACAAGATCAAGTCACAGCAATGCCTATATGTATGGATTCTTCAAGAACAAG 
AGGATTGTCCCTTATGACACATTAATTCAACAGTGCAAAGACGATGAGGAAATTGTTGCTGTTAT 
T GC C CAT GAG T T G GG AC AC T GG AAGC T C AAC CAT AC T G T G T AC AC AT T T G T T GC T AT GC AG AT TC 
TTACACTTCTACAATTTGGAGGATATACACTAGTGCGAAATTCAGCTGATCTGTATCGAAGCTTT 
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GGGTTTGATACGCAGCCAGTCCTCATTGGGCTCATCATATTTCAGCATACTGTT^ATCCCACTTCA 
GCAATTGGTCAGCTTTGGTCTGAACCTAGTCAGCCGATCATTTGAATTTCAGGCTGATGGCTTTG 
CCAAGAAGCTTGGATATGCATCTGGATTACGCGGTGGTCTTGTGAAACTACAGGAGGAGAATCTG 
TCAGCTATGAATACAGATCCTTGGTACTCTGCTTATCACTATTCTCATCCTCCCCTTGTTGAAAG 
ATTGGCCGCGCTGGACGAACCGGATAAGAAGGAAGACTAA 



A disclosed CPP polypeptide (SEQ ID NO:18) encoded by SEQ ID NO:17 has 
424 amino acid residues and is presented in Table 5B using the one-letter amino acid 
code. 



Table 5B. Encoded CPP protein sequence (SEQ ID NO: 18). 



MAFPYMEAVVGFMILMYIFETYLDVRQHRALKLPTLPKTLEGVISQEKFEKSRAYSLDKS 
HFHFVHEFVTIVTDSTILYFGVLPWFWKKSGDFMTIAGFNAENEILHTLAFLAGLMIWSQ 
ITDLPFSLYSTFVIEARHGFNKQTPWLFFRDMLKGIFLSVIIGPPIVAAIIVIVQKGGPY 
LAIYLWVFTFGLSIVMMTLYPVLIAPLFNKFTPLPDGQLREKIEKLASSLNYPLKKLFVV 
DGSTRSSHSNAYMYGFFKNKRIVPYDTLIQQCKDDEEIVAVIAHELGHWKLNHTVYTFVA 
MQILTLLQFGGYTLVRNSADLYRSFGFDTQPVLIGLI I FQHTVI PLQQLVSFGLNLVSRS 
FEFQADGFAKKLGYASGLRGGLVKLQEENLSAMNTDPWYSAYHYSHPPLVERLAALDEPD 
KKED 



The present invention also includes a nucleic acid sequence complimentary to the 
Glycine max CaaX prenyl protease of SEQ ID NO: 17. The disclosed complimentary 
sequence is shown as SEQ ID NO: 19. 



10 SEQ ID NO: 19 

TTAGTCTTCCTTCTTATCCGGTTCGTCCAGCGCGGCCAATCTTTCAACAAGGGGAGGAT 
GAGAATAGTGATAAGCAGAGTACCAAGGATCTGTATTCATAGCTGACAGATTCTCCTCC 
TGTAGTTTCACAAGACCACCGCGTAATCCAGATGCATATCCAAGCTTCTTGGCAAAGCC 
ATCAGCCTGAAATTCAAATGATCGGCTGACTAGGTTCAGACCAAAGCTGACCAATTGCT 

15 GAAGTGGGATTACAGTATGCTGAAATATGATGAGCCCAATGAGGACTGGCTGCGTATCA 
AACCCAAAGCTTCGATACAGATCAGCTGAATTTCGCACTAGTGTATATCCTCCAAATTG 
TAGAAGTGTAAGAATCTGCATAGCAACAAATGTGTACACAGTATGGTTGAGCTTCCAGT 
GTCCCAACTCATGGGCAATT^ACAGCAACAATTTCCTCATCGTCTTTGCACTGTTGAATT 
AATGTGTCATAAGGGACAATCCTCTTGTTCTTGAAGAATCCATACATATAGGCATTGCT 

20 GTGACTTGATCTTGTGGATCCATCGACAACAAATAGTTTCTTTAACGGATAGTTGAGGG 
AGGAAGCAAGTTTCTCGATTTTCTCCCTGAGTTGACCATCTGGAAGTGGAGTGAACTTA 
TTGAAGAGTGGAGCTATTAGTACTGGATAAAGGGTCATCATCACAATAG7\AAGACCAAA 
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CGTAAAAACCCAAAGATAGATGGCCAAGTATGGACCTCCTTTCTGTACTATTACAATGA 
TTGCAGCCACAATAGGTGGACCAATTATTACAGAAAGGAA7U\TTCCTTTAAGCATGTCC 
CTAAAGAATAACCATGGTGTTTGCTTATTAAAACCATGACGGGCCTCAATCACAAAAGT 
TGAGTACAGAGAAAAGGGCAAATCTGTTATCTGTGACCAAATCATCAGCCCTGCTAAGA 
5 AGGCAAGGGTATGCAGTATTTCATTCTCAGCATTGAAACCAGCTATTGTCATAAAATCT 
CCTGATTTCTTCCAAAACCAGGGCAATACCCCAAAGTACAAAATTGTAGAGTCTGTCAC 
TATTGTCACAAACTCGTGAACAAAATGGAAGTGGCTTTTATCAAGACTATAGGCTCTAG 
ATTTCTCAAATTTCTCTTGGCTGATAACACCCTCTAAAGTCTTTGGAAGAGTAGGAAGT 
TTGAGGGCCCTATGTTGTCGCACATCCAAGTAAGTTTCAAAAATGTACATTAATATCAT 
10 AAATCCGACAACGGCTTCCATGTAGGGAAACGCCAT 



Due to the nature of the cloning strategy the sequence presented is not full length 
but is missing the 5' and 3' non-translated regions. The percent identities of the Glycine 
max nucleotide sequence and its encoded amino acid sequence to that of other CPP 

15 sequences as determined by ClustalW analysis are shown in Figure 2. 

Using the sequences disclosed herein as hybridization probes, one is able to 
screen and isolate full length sequences from cDNA or genomic libraries or use the rapid 
amplification of cDNA ends (RACE) technology or other such PCR techniques. 
The CPP nucleic acids and amino acids disclosed above have homology to other 

20 disclosed CPP sequences (GenBank ID NOs: AL161491 (AT4g01320), AF007269 and 
AF353722; WO 02/16625 A2 ). The homology between these and other sequences is 
shown in the ClustalW alignment analysis shown in Tables 6A-6B. 



Table 6A. ClustalW Nucleic Acid Analysis of CaaX Prenyl Protease 

25 

1 : PPI-AtCPP SEQ ID NO: 1 

2: PPI-BnCPP SEQ ID NO: 14 

3 : PPI-GmCPP SEQ ID NO: 1 7 

4: BASF AT1 SEQ ID NO:21 
30 5: BASF AT2 SEQ ID NO:23 

6: BASF-Corn SEQ ID NO:25 

7: BASF-Gm SEQ ID NO:27 

8:AFC1 SEQIDNO:29 

9: AT4g01320 SEQ ID NO:31 
35 10: AF007269 SEQ ID NO:33 

CLUSTAL W (1.81) multiple sequence alignment 
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PPI-GmCPP 

BASF-Gm 

AT4g01320 

AF0072 69 ATGGCGATTCCTTTCATGGAAACCGTCGTGGGTAAGCTTCAAAACCTTTTTCTGAGACAT 

5 PPI-AtCPP 

BASF_AT2 

afcl 

BASF_AT1 

PPI-BnCPP 

10 BASF-Corn 



PPI-GmCPP 

BASF-Gm 

15 AT4g01320 

AF0072 69 TTTACTATCCTGTTTCACTCATCGTATTTCGTTTTTGTTTGGGTTTTGCTTTCTGTGTTG 

PPI-AtCPP 

BASF_AT2 

afcl 

20 BASF_AT1 

PPI-BnCPP 

BASF-Corn 



25 PPI-GmCPP 

BASF-Gm 

AT4g01320 

AF0072 69 TGTGTGTTGAGATTCCATGACTCGTTTGTTTCATATACCATCGTCTCTGCTTCTCGTTTC 

PPI-AtCPP 

30 BASF_AT2 

afcl 

BAS F_AT 1 

PPI-BnCPP 

BASF-Corn 

35 

PPI-GmCPP 

BASF-Gm 

AT4g01320 

40 AFO 072 69 TAAATTTTGTTCTTTTCTAATAGTGCGTACCTTGATCTGAGGTTTTATTACTCCTACTAG 

PPI-AtCPP 

BASF_AT2 

afcl 

BAS F_AT 1 

45 PPI-BnCPP 

BASF-Corn 



50 PPI-GmCPP 

BASF-Gm 

AT4g01320 

AF007 2 69 TTTCTTGTCTTACTCGTGCGTTTGATTTGATTTGAGCTTATGTGATTTCATCATCTCTTC 

PPI-AtCPP 

55 BASF_AT2 

afcl 

BASF_AT1 

PPI-BnCPP 

BASF-Corn 

60 

PPI-GmCPP 

BASF-Gm 

AT4g01320 

65 AFO 07 2 69 CTCGGTTTTAGAATGTACGGAGCTTCTCTGTTAACCAAAATCTAGGATTTGGGAAGAAAA 

PPI-AtCPP 

BASF__AT2 

afcl 



65 



WO 03/012116 



PCT/IB02/03887 



BASF_AT1 

PPI-BnCPP 

BASF-Corn 

5 

PPI-GmCPP 

BASF-Gm 

AT4g01320 

AF0072 69 GTCGGAGTCTTTTTTTTCCTCATTCCCGATTGGAAATTGAGAATCTTGAAATTTTTCTTT 

10 PPI-AtCPP 

BASF_AT2 

afcl 

BAS F_AT 1 

PPI-BnCPP 

15 BASF-Corn 



20 



25 



30 



35 



40 



45 



50 



PPI-GmCPP 

BASF-Gm 

AT4g01320 

AF007269 

PPI-AtCPP 

BASF_AT2 

afcl 

BAS F_AT 1 

PPI-BnCPP 

BASF-Corn 

PPI-GmCPP 

BASF-Gm 

AT4g01320 

AF007269 

PPI-AtCPP 

BASF_AT2 

afcl 

BAS F_AT 1 

PPI-BnCPP 

BASF-Corn 



PPI-GmCPP 

BASF-Gm 

AT4g01320 

AF007269 

PPI-AtCPP 

BASF_AT2 

afcl 

BAS F_AT 1 

PPI-BnCPP 

BASF-Corn 



CTAATACGACTCACTATAGGGC 

GTTCAAGTCATACAGCTTGAGGTTTTGGGTTTTCTTGTCAGGGTATTATTATGTTCGTGA 

AAGCAGTGGTAACAACGCAGAGTACGCGGGGGGAGACGCATGGTTCTGAACTAATTGTTA 
CTGCAACTAGAGTTTTCTGGAGTTTTTTGAAATGGGTTTTGTGTTGTGGAACCGTATGTG 

TAAATAATACCTAAAATTTTGAGTTGTCCTAAACATTGGGGTTTAAACAAATCCAATCTC 
AATGTTGCATCAAAACTCTTTCAGTGCTCCAATGTTTCCATCAGTAGTCAGCACAAGAGA 



PPI-GmCPP 



BASF-Gm TCAATATAAAACCCAATGATCTCACC — CTCACTCCGTTTCTGATTTCTCACTCTTCGTT 

55 AT4g01320 

AF0 07269 T C T T T T TAT AT C T GG T T GAT C AAAAAAG T AG AT GAT G T T AT T G AAT T T T C AG T G AT GG AG 

PPI-AtCPP 

BASF_AT2 

afcl 

60 BAS F_AT 1 

PPI-BnCPP 

BASF-Corn 

PPI-GmCPP ATGGCGTTTCCC — TACATGGAAGCCG 

65 BASF-Gm TCTCGTTCGGTTCATCAGCGTGTGTCTCAGC-CATGGCGTTTCCC — TACATGGAAGCCG 

AT4g01320 ATGGCGATTCCT — TTCATGGAAACCG 

AF0072 69 TATCTGTTGTTGTGGCATTTAGAGTAGATTCGTATTTCATCTTCTGTTTTATTCTTTTTC 

PPI-AtCPP ATGGCGATTCCT — TTCATGGAAACCG 
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BASF_AT2 ATGGCGATTCCT — TTCATGGAAM^^ 

afcl ATGGCGATTCCT — TTCATGGAAACCG 

BASF_AT1 ATGGCGATTCCT — TTCATGGAAACCG 

PPI-BnCPP ATGGCGATTCCT — TTCATGGAAACCG 

5 BASF-Corn 

PPI-GmCPP TTGTCGGATTTATGATATTAATGTACATTTTTGAAACTTACTTGGATGTGCGACAACATA 

BASF-Gm TTGTCGGATTTATGATATTAATGTACATTTTTGAAACTTACTTGGATGTGCGACAACATA 

AT4g0132 0 TCGTGGGTTTTATGATAGTGATGTACATTTTTGAGACGTATTTGGATCTGAGGCAACTCA 

1 0 AFO 07269 TTACAGGTTTTATGATAGTGATGT ACATTTTTGAGACGT AT T T GG AT CTGAGGCAACTC A 

PPI-AtCPP TCGTGGGTTTTATGATAGTGATGTACATTTTTGAGACGTATTTGGATCTGAGGCAACTCA 

BASF_AT2 TCGTGGGTTTTATGATAGTGATGTACATTTTTGAGACGTATTTGGATCTGAGGCAACTCA 

afcl TCGTGGGTTTTATGATAGTGATGTACATTTTTGAGACGTATTTGGATCTGAGGCAACTCA 

B AS F_AT 1 TCGTGGGTTTTATGATAGTGATGTACATTTTTGAGACGTATTTGGATCTGAGGCAACTCA 

15 PPI-BnCPP TCGTTGGTTTTATGATAGTGATGTACGTTTTTGAGACGTATTTGGATCTGAGGCAACATA 

BASF-Corn 

PPI-GmCPP GGGCCCTCAAACTTCCTACTCTTCCAAAGACTTTAGAGGGTGTTATCAGCCAAGAGAAAT 

BASF-Gm GGGCCCTCAAACTTCCTACTCTTCCAAAGACTTTAGAAGGTGTTATCAGCCAAGAGAAAT 

20 AT4g01320 CTGCTCTCAAGCTTCCAACTCTCCCGAAAACCTTGGTTGGTGTAATTAGCCAAGAGAAGT 

AF0072 69 CTGCTCTCAAGCTTCCAACTCTCCCGAAAACCTTGGTTGGTGTAATTAGCCAAGAGAAGT 

PPI-AtCPP CTGCTCTCAAGCTTCCAACTCTCCCGAAAACCTTGGTTGGTGTAATTAGCCAAGAGAAGT 

BASF_AT2 CTGCTCTCAAGCTTCCAACTCTCCCGAAAACCTTGGTTGGTGTAATTAGCCAAGAGAAGT 

afcl CTGCTCTCAAGCTTCCAACTCTCCCGAAAACCTTGGTTGGTGTAATTAGCCAAGAGAAGT 

25 BAS F_AT 1 CTGCTCTCAAGCTTCCAACTCTCCCGAAAACCTTGGTTGGTGTAATTAGCCAAGAGAAGT 

PPI-BnCPP CTGCTCTCAAGCTTCCCACTCTCCCAAAGACTTTGGTTGGAGTCATTAGCCAAGAGAAGT 

BASF-Corn 

30 PPI-GmCPP T T G AG AAAT C TAG AG C C TAT AG 

BASF-Gm T T GAG AAAT C TAG AG C C TAT AG 

AT4g01320 TTGAGAAATCACGAGCATACAG 

AF0072 69 T T GAG AAAT C AC G AG C AT AC AGT C T T G AC AAAAG GT TTCGTCTT GAT CAT AT T TAT AT C A 

PPI-AtCPP TTGAGAAATCACGAGCATACAG 

35 BASF_AT2 TTGAGAAATCACGAGCATACAG 

afcl T T GAG AAAT C ACGAGC AT AC AG 

BAS F_AT 1 TTGAGAAATCACGAGCATACAG 

PPI-BnCPP TTGAGAAATCTCGAGCTTACAG 

BASF-Corn 

40 

PPI-GmCPP TCTTGATAAA AGCCA 

BASF-Gm TCTTGATAAA AGCCA 

AT4g01320 GGATATCATCACTGAGAACTTTAATATATGCAGCTA 

45 AF0072 69 TTTTAGTTTTTTATAATTGCCAGGGGATATCATCACTGAGAACTTTAATATATGCAGCTA 

PPI-AtCPP T CT T G AC AAA AGCTA 

BASF_AT2 TCTTGACAAA AGCTA 

afcl TCTTGACAAA AGCTA 

BAS F_AT 1 TCTT G AC AAA AGCTA 

50 PPI-BnCPP TCTTGACAAA AGCCA 

BASF-Corn 

PPI-GmCPP CTTCCATTTTGTTCACGAGTTTGTGACAATAGTGACAGACTCTACAATTTTGTACTTTGG 

BASF-Gm CTTCCATTTTGTTCACGAGTTTGTGACAATAGTGACAGACTCTACAATTTTGTACTTTGG 

55 AT4g01320 TTTTCACTTTGTTCATGAGTTTGTAACTATACTTATGGACTCTGCAATTTTGTTCTTTGG 

AF007 269 TTTTCACTTTGTTCATGAGTTTGTAACTATACTTATGGACTCTGCAATTTTGTTCTTTGG 

PPI-AtCPP TTTTCACTTTGTTCATGAGTTTGTAACTATACTTATGGACTCTGCAATTTTGTTCTTTGG 

BASF_AT2 TTTTCACTTTGTTCATGAGTTTGTAACTATACTTATGGACTCTGCAATTTTGTTCTTTGG 

afcl TTTTCACTTTGTTCATGAGTTTGTAACTATACTTATGGACTCTGCAATTTTGTTCTTTGG 

60 BASF_AT1 TTTTCACTTTGTTCATGAGTTTGTAACTATACTTATGGACTCTGCAATTTTGTTCTTTGG 

PPI-BnCPP TTTTCACTTTGTTCATGAGTTTGTTACTATACTTATGGACTCTGCGATTCTGTTCTTTGG 

BASF-Corn 

PPI-GmCPP GGTATTGCCCTGGTTTTGGAAG 

65 BASF-Gm GGTATTGCCCTGGTTTTGGAAG 

AT4g01320 GATCTTGCCTTGGTTTTGGAAG 

AFO 072 69 GATCTTGCCTTGGTTTTGGAAGGTACATATCTGGTTTCGGTATACAGTATCTCATTTTGA 

PPI-AtCPP GATCTTGCCTTGGTTTTGGAAG 
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BASF_AT2 
afcl 

BAS F_AT 1 
PPI-BnCPP 
5 BASF-Corn 

PPI-GmCPP 
BASF-Gm 
AT4g01320 
AF007269 
PPI-AtCPP 
BASF_AT2 
afcl 

BAS F_AT 1 
PPI-BnCPP 
BASF-Corn 
PPI-GmCPP 
BASF-Gm 
AT4g01320 
AF007269 
PPI-AtCPP 
BASF_AT2 
afcl 

BAS F_AT 1 
PPI-BnCPP 
BASF-Corn 

PPI-GmCPP 
BASF-Gm 
30 AT4g01320 
AF007269 
PPI-AtCPP 
BASF_AT2 
afcl 
35 BAS F_AT 1 
PPI-BnCPP 
BASF-Corn 



40 PPI-GmCPP 
BASF-Gm 
AT4g01320 
AF007269 
PPI-AtCPP 

45 BASF_AT2 
afcl 

BAS F__AT 1 

PPI-BnCPP 

BASF-Corn 

50 

PPI-GmCPP 
BASF-Gm 
AT4g01320 
AF007269 
55 PPI-AtCPP 
BASF_AT2 
afcl 

BAS F_AT 1 
PPI-BnCPP 
60 BASF-Corn 



PPI-GmCPP 
BASF-Gm 
65 AT4g01320 
AF007269 
PPI-AtCPP 
BASF AT 2 



15 



20 



GATCTTGCCTTGGTTTTGGAAG- 
GATCTTGCCTTGGTTTTGGAAG 
GATCTTGCCTTGGTTTTGGAAG 
GATCTTGCCTTGGTTTTGGAAG 



AAATCAGGAGAT 

AAATCAGGAGAT 

ATGTCTGGAGCT 

ATATAGAGTTGTTACATTACAATTGTAAAGTTTTCATTTTTACCTTAGATGTCTGGAGCT 

ATGTCTGGAGCT 

ATGTCTGGAGCA 

ATGTCTGGAGCT 

ATGTCTGGAGCT 

ATATCTGGCGGC 



TTTATGACAATAGCTGGTTTCAATGCTGAGAATGAAATACTGCATACCCTTGCCTTCTTA 
TTTATGACAATAGCTGGTTTCAATGCTGAGAATGAAATACTGCATACCCTTGCCTTCTTA 
GTTTTACCGAGGTTGGGCCTTGATCCAGAGAATGAAATACTGCATACTCTTTCATTCTTG 
GTTTTACCGAGGTTGGGCCTTGATCCAGAGAATGAAATACTGCATACTCTTTCATTCTTG 
GTTTTACCGAGGTTGGGCCTTGATCCGGAGAATGAAATACTGCATACTCTTTCATTCTTG 
GTTTTACCGAGGTTGGGCCTTGATCCAGAGAATGAAATACTGCATACTCTTTCATTCTTG 
GTTTTACCGAGGTTGGGCCTTGATCCAGAGAATGAAATACTGCATACTCTTTCATTCTTG 
GTTTTACCGAGGTTGGGCCTTGATCCAGAGAATGAAATACTGCATACTCTTTCATTCTTG 
TTTCTACCAATGGTGGGACTCGATCCAGAGAATGAAATCCTGCACACTCTTTCATTCTTG 

AC G AGG CT GAG T G C T G AG AAT G AG AT AAT AC AC AC CCTTGCTTTCTTA 

* * * ★ ******** ** * *★ *★ *** * ***** 

G C AG GG C T GAT GAT T T G G T C AC AG " 

G C AGG G C T GAT GAT T T G GT C AC AG 

GCTGGTGTTATGACATGGTCACAG 

GCTGGTGTTATGACATGGTCACAGGTGTTCCAAATAAACCCCTTCATATAGTCCTATACG 

GCTGGTGTTATGACATGGTCACAG 

GCTGGTGTTATGACATGGTCACAG 

GCTGGTGTTATGACATGGTCACAG 

GCTGGTGTTATGACATGGTCACAC 

GCTGGTCTTATGACATGGTCACAG 

GCTGGTTCCATGGTTTGGTCGCAG 

** ** *** ***** ** 



T T T AG CAT C AAAAT AT CT AT T T T C T T AAG AT AAT AAT AT T T C T T T TAT AT T C T GAT G C AG 



ATAACAGATTTGCCCTTTTCTCTGTACTCAACTTTTGTGATTGAGGCCCGTCATGGTTTT 
ATAACAGATTTGCCCTTTTCTCTGTACTCAACTTTTGTGATTGAGGCCCGTCATGGTTTT 
ATCACTGATTTGCCATTTTCTTTGTACTCAACTTTCGTGATCGAGTCTCGGCATGGGTTC 
ATCACTGATTTGCCATTTTCTTTGTACTCAACTTTCGTGATCGAGTCTCGGCATGGGTTC 
ATCACTGATTTGCCATTTTCTTTGTACTCAACTTTCGTGATCGAGTCTCGGCATGGGTTC 
ATCACTGATTTGCCATTTTCTTTGTACTCAACTTTCGTGATCGAGTCTCGGCATGGGTTC 
ATCACTGATTTGCCATTTTCTTTGTACTCAACTTTCGTGATCGAGTCTCGGCATGGGTTC 
ATCACTGATTTGCCATTTTCTTTGTACTCAACTTTCGTGATCGAGTCTCGGCATGGGTTC 
ATCACTGATTTGCCATTTTCTTTGTACTCAACTTTCGTGATCGAGTCTCGGCATGGGTTC 
ATTACAGACTTGCCGTTCTCTCTCTATTCAACTTTTGTTATAGAGGCTCGACATGGTTTT 
** ** ★★ ***** ** *** * ** ******** ** ** *** * ** ***** ** 

AATAAG 

AATAAG 

AACAAA 

AACAAAGTATGTCGTATTTCCAACACTACCTTGTGACTTACGTTTTTTTATCAGAGATGT 

AACAAA 

AACAAA 
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afd AACAAA 

BAS F_AT 1 AACAAA 

PPI-BnCPP AACAAA 

BASF-Corn AACAAG 

5 * * * * 

PPI-GmCPP CAAACACCATGGTTATTCTTTAGGGACA 

BASF-Gm CAAACACCATGGTTATTCTTTAGGGACA 

AT4g01320 C AAAC AAT AT G G AT G T T CAT TAG G G AC A 

AF007 269 GG AT T AAAT T TGCT T CT AAATT CTGTT G AC AGC AAAC AAT AT GG AT GTTC AT TAGGG AC A 

10 PPI-AtCPP C AAAC AAT AT GG AT GT T C ATT AGGG AC A 

BASF_AT2 C AAAC AAT ATGGAT GTTC ATT AGGG AC A 

afcl CAAACAATATGGATGTTCATTAGGGACA 

BAS F_ ATI CAAACAATATGGATGTTCATTAGGGACA 

PPI-BnCPP CAAACAATATGGATGTTCATTAGGGACA 

15 BASF-Corn CAAACTATATGGCTCTTCATTAGGGATA 

**** ★ *** * 

PPI-GmCPP TGCTTAAAGGAATTTTCCTTTCTGTAATAATTGGTCCACCTATTGTGGCTGCAATCATTG 

BASF-Gm TGCTTAAAGGAATTTTCCTTTCCGTAATAATTGGTCCACCTATTGTGGCTGCAATCATTG 

20 AT4g01320 TGATCAAAGGAACATTCCTCTCTGTCATACTAGGCCCACCCATTGTTGCTGCGATAATTT 

AF0072 69 TGATCAAAGGAACATTCCTCTCTGTCATACTAGGCCCACCCATTGTTGCTGCGATAATTT 

PPI-AtCPP TGATCAAAGGAACATTCCTCTCTGTCATACTAGGCCCACCCATTGTTGCTGCGATAATTT 

BASF_AT2 TGATCAAAGGAACATTCCTCTCTGTCATACTAGGCCCACCCATTGTTGCTGCGATAATTT 

afcl TGATCAAAGGAACATTCCTCTCTGTCATACTAGGCCCACCCATTGTTGCTGCGATAATTT 

25 BAS F_AT 1 T GAT C AAAGG AAC AT TCCTCTCTGT CAT AC TAG G C C C AC C CAT TGTTGCCGC GAT AAT T T 

PPI-BnCPP TGATCAAAGGAATACTCCTCTCTGTCATACCTGCCCCTCCTATCGTTGCCGCAATTATTG 

BASF-Corn TGATCAAAGGAATTTTACTATCCATGATATTGGGGCCACCAATCGTGGCTGCTATCATCT 
*■ * * ★ ★* ** * ★ *■ ★ * *★ * * ** *★ ★* ★* 

30 PPI-GmCPP TAATAGTACAG 

BASF-Gm TAATAGTACAG 

AT4g01320 TCATAGTCCAG 

AF007 2 69 TCATAGTCCAGGTTTGATGATTCTGGATTCATCTTATTTCTGAGTTTTTCACATGGATGA 

PPI-AtCPP TCATAGTCCAG 

35 BASF_AT2 TCATAGTCCAG 

afcl TCATAGTCCAG 

BAS F_AT 1 TCATAGTCCAG 

PPI-BnCPP TTATAGTTCAG 

BASF-Corn ACATAGTACAG 

PPI-GmCPP 

BASF-Gm 

AT4g01320 

AF007 2 69 CTATTCTCCATTGAGTGTGAGCTTCAAAGTTTTTAGTTTTCGTGTTAAAAATTTAAAATT 

45 PPI-AtCPP 

BASF_AT2 

afcl 

BAS F_AT 1 

PPI-BnCPP 

50 BASF-Corn 

PPI-GmCPP AAAGGAGGTCCATACTTGGCCATC 

BASF-Gm AAAGGAGGTCCATACTTGGCCATC 

AT4g01320 AAAGGAGGTCCTTATCTTGCCATC 

55 AF0072 69 TGCTTCTCTGAGCATGAAGTTTCTATCTTTTTCCAGAAAGGAGGTCCTTATCTTGCCATC 

PPI-AtCPP AAAGGAGGTCCTTATCTTGCCATC 

BAS F_AT2 AAAGGAGGTCCTTATCTTGCCATC 

afcl AAAGGAGGTCCTTATCTTGCCATC 

BAS F_AT 1 AAAGGAGGTCCTTATCTTGCCATC 

60 PPI-BnCPP AAAGGAGGTCCTTACCTCGCCATC 

BASF-Corn ATTGGAGGACCTTACCTGGCTATA 

★ *•** + ★ ** ** * ** * * 

PPI-GmCPP TATCTTTGGGTTTTTACGTTTGGTCTTTCTATTGTGATGATGACCCTTTATCCAGTACTA 

BASF-Gm TATCTTTGGGTTTTTACGTTTGGTCTTTCTATTGTGATGATGACCCTTTATCCAGTACTA 

65 AT4g01320 TATCTGTGGGCATTCATGTTTATCCTGTCTCTAGTGATGATGACTATATACCCGGTCTTG 

AF0072 69 TATCTGTGGGCATTCATGTTTATCCTGTCTCTAGTGATGATGACTATATACCCGGTCTTG 

PPI-AtCPP TATCTGTGGGCATTCATGTTTATCCTGTCTCTAGTGATGATGACTATATACCCGGTCTTG 

BASF AT 2 TATCTGTGGGCATTCATGTTTATCCTGTCTCTAGTGATGATGACTATATACCCGGTCTTG 
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afcl 

BASF_AT1 

PPI-BnCPP 

BASF-Corn 

5 

PPI-GmCPP 
BASF-Gm 
AT4g01320 
10 AF007269 
PPI-AtCPP 
BASF_AT2 
afcl 

BASF_AT1 
15 PPI-BnCPP 
BASF-Corn 

PPI-GmCPP 
BASF-Gm 
AT4g01320 
AF007269 
PPI-AtCPP 
BASF_AT2 
afcl 

BASF_AT1 
PPI-BnCPP 
BASF-Corn 

PPI-GmCPP 
30 BASF-Gm 

AT4g01320 
AF007269 
PPI-AtCPP 
BASF_AT2 
35 afcl 

BAS F_AT 1 

PPI-BnCPP 

BASF-Corn 

40 

PPI-GmCPP 
BASF-Gm 
AT4g01320 
AF007269 
45 PPI-AtCPP 
BASF_AT2 
afcl 

BASF_AT1 
PPI-BnCPP 
50 BASF-Corn 

PPI-GmCPP 
BASF-Gm 
AT4g01320 
AF007269 
PPI-AtCPP 
BAS F_ AT 2 
afcl 

BAS F_AT 1 
PPI-BnCPP 
BASF-Corn 

PPI-GmCPP 
BASF-Gm 
65 AT4g01320 
AF007269 
PPI-AtCPP 
BASF AT 2 



20 



25 



55 



60 



TATCTGTGGGCATTCATGTTTATCCTGTCTCTAGTGATGATGACTATATACCCGGTCTTG 
TATCTGTGGGCATTCATGTTTATCCTGTCTCTAGTGATGATGACTATATACCCGGTCTTG 
TATCTGTGGGCATTCATGTTTATCCTGTCTCTAGTGATGATGACTATATACCCTGTTTTG 
TATCTCTGGGGTTTTATGTTTGTATTAGCTCTACTGATGATGACAATATACCCCATTGTG 
***** **** ** * **** * ★* * ********** * ** ** * * 



ATAGCTCCACTCTTCAATAAGTTCACTCCA 

ATAGCTCCACTCTTCAATAAGTTCACTCCA 

ATAGCACCGCTCTTCAACAAGTTCACTCCT 

ATAGCACCGCTCTTCAACAAGTTCACTCCTGTGTGTATTTCTGTCATGGCCATTTTACAA 

ATAGCACCGCTCTTCAACAAATTCACTCCT 

ATAGCACCGCTCTTCAACAAGTTCACTCCT 

ATAGCACCGCTCTTCAACAAGTTCACTCCT 

ATAGCACCGCTCTTCAACAAGTTCACTCCT 

ATTGCACCTCTTTTCAACAAGTTCACTCCT 

ATAGCTCCTCTGTTCAACAAGTTCACTCCT 

** ** ** ** ***** ** ******** 



TTCACTGCTTGTTTGCATATGTTGTTACCAGACAATATAATCTCCCGCTTTTTTATGGCT 



CTTCCAGATGGTCAACTCAGGGAGAAAATCGAGAAACTTGCTTCCTCCCTCAACTA 

CTTCCAGATGGTCAACTCAGGGAGAAAATCGAGAAACTTGCTTCCTCCCTCAACTA 

CTTCCAGATGGAGACCTCCGGGAGAAGATTGAGAAACTTGCTTCTTCTCTAAAGTT 

ATAGCTTCCAGATGGAGACCTCCGGGAGAAGATTGAGAAACTTGCTTCTTCTCTAAAGTT 

CTTCCAGATGGAGACCTCCGGGAGAAGATTGAGAAACTTGCTTCTTCCCTAAAGTT 

CTTCCAGATGGAGACCTCCGGGAGAAGATTGAGAAACTTGCTTCTTCTCTAAAGTT 

CTTCCAGATGGAGACCTCCGGGAGAAGATTGAGAAACTTGCTTCTTCTCTAAAGTT 

CTTCCAGATGGAGACCTCCGGGAGAAGATTGAGAAACTTGCTTCTTCTCTAAAGTT 

CTTCCTGATGGAGACCTCCGGGAGAAGATTGAGAAACTTGCTTCTTCTCTAAAGTT 

CTTCCTGAAGGAGTCCTCAGGGAAAAAATAGAGAAGCTGGCAGCTTCCCTCAAGTT 

***** ** ** *** **** ** *★ ***** ** ** * ** ** ** * 

T C C G T T AAAG AAAC TAT TTGTTGTC GAT G GAT C C AC AAG AT C AAG T C AC AG C AAT G 

TCCGTTAAAGAAACTATTTGTTGTCGATGGATCCACAAGATCAAGTCACAGCAATG 

TCCTTTGAAGAAGCTGTTTGTTGTCGATGGATCTACAAGGTCAAGCCATAGCAATG 

TCCTTTGAAGAAGCTGTTTGTTGTCGATGGATCTACAAGGTCAAGCCATAGCAATGTGAG 

TCCTTTGAAGAAGCTGTTTGTTGTCGATGGATCTACAAGGTCAAGCCATAGCAATG 

TCCTTTGAAGAAGCTGTTTGTTGTCGATGGATCTACAAGGTCAAGCCATAGCAATG 

TCCTTTGAAGAAGCTGTTTGTTGTCGATGGATCTACAAGGTCAAGCCATAGCAATG 

TCCTTTGAAGAAGCTGTTTGTTGTCGATGGATCTACAAGGTCAAGCCATAGCAATG 

TCCTCTGAAGAAGCTGTTTGTTGTCGATGGATCTACAAGGTCAAGCCATAGTAATG 

TCCTTTGAAAAAGCTTTTCGTGGTAGATGGGTCTACCAGATCAAGCCACAGTAATG 

*** * ** ** ** ** ** ** ***** ** ★* ** ***** ** ** *★** 



AAGCTTGAGATCTCTTCCTACCTACTTTACTCTAGTTTACCATTAGAAGCTTACGTATCT 



C C TAT AT GT AT G G AT T C T T C AAG AAC AAG AG GAT TGTCCCTT AT 

CCTATATGTATGGATTCTTCAAGAACAAGAGGATTGTCCTTTAT 

CTTACATGTATGGTTTCTTTAAGAACAAAAGGATTGTTCTTTAT 

TGTTACATCATACAGGCTTACATGTATGGTTTCTTTAAGAACAAAAGGATTGTTCTTTAT 

CTTACATGTATGGTTTCTTTAAGAACAAAAGGATTGTTCTTTAT 

CTTACATGTATGGTTTCTTTAAGAACAAAAGGATTGTTCTTTAT 
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af cl CTTACATGTATGGTTTCTTTAAGAACAAAAGGATTGTTCTTTAT 

BAS F_AT 1 CTTACATGTATGGTTTCTTTAAGAACAAAAGGATTGTTCTTTAT 

PPI-BnCPP CTTACATGTATGGTTTCTTCAAGAACAAAAGGATTGTTCTTTAT 

BASF-Corn CCTACATGTATGGTTTTTTCAAGAACAAGCGCATAGTACTCTAT 

5 * ** ******** ** ** ******** * ** ** * *** 

PPI-GmCPP G AC AC AT T AAT T C AAC AG 

BAS F-Gm G AC AC AT T AAT T C AAC AG 

AT4g01320 GATACGTTGATTCAGCAG 

AFO 07269 GATACGTTGATTCAGCAGGT ACTGTGACTCTTGATGCTTCAAACGAGCT AT ACT CACATT 

10 PPI-AtCPP GATACGTTGATTCAGCAG 

BAS F_AT 2 GAT ACGTTGATT CAGC AG 

afcl GATACGTTGATTCAGCAG 

BAS F_AT1 GATACGTTGATTCAGCAG 

PPI-BnCPP G AC AC AT T GAT T CAGC AG 

15 BASF-Corn G AC AC AT T GAT T CAGC AG 

** ** ** ***** *** 

PPI-GmCPP TGCAAAGACGATGAGG 

BAS F-Gm TGCAAAGACGATGAGG 

20 AT4g01320 TGCAAGAATGAGGATG 

AF007 269 TCTGTTTCTGGTTCTGAAACATAACATAATCTTCTATTGTGCAGTGCAAGAATGAGGATG 

PPI-AtCPP TGCAAGAATGAGGATG 

BAS F_AT 2 T GCAAGAAT G AGGAT G 

afcl TGCAAGAATGAGGATG 

25 BAS F_AT 1 TGCAAGAATGAGGATG 

PPI-BnCPP TGCCAGAATGAGAATG 

BAS F-Corn TGT AGC AATG AGG ATG 

* * * * * * * 

30 PPI-GmCPP AAATTGTTGCTGTTATTGCCCATGAGTTGGGACACTGGAAGCTCAACCATACTGTGTACA 

BAS F-Gm AAATTGTTGCTGTTATTGCCCATGAGTTGGGACACTGGAAGCTCAACCATACTGTGTACA 

AT4g0132 0 AAATTGTGGCGGTTATTGCACACGAGCTTGGACATTGGAAACTGAATCACACTACATACT 

AFO 072 69 AAATTGTGGCGGTTATTGCACACGAGCTTGGACATTGGAAACTGAATCACACTACATACT 

PPI-AtCPP AAATTGTGGCGGTTATTGCACACGAGCTTGGACATTGGAAACTGAATCACACTACATACT 

35 BASF_AT2 AAATTGTGGCGGTTATTGCACACGAGCTTGGACATTGGAAACTGAATCACACTACATACT 

afcl AAATTGTGGCGGTTATTGCACACGAGCTTGGACATTGGAAACTGAATCACACTAC AT ACT 

BAS F_AT 1 AAATTGTGGCGGTTATTGCACACGAGCTTGGACATTGGAAACTGAATCACACTACATACT 

PPI-BnCPP AAATTGTGGCGGTTATTGCACACGAGCTGGGACACTGGAAGCTGAATCACACTACATACT 

BASF-Corn AGATAGTTTCTGTTATAGCACATGAACTTGGACACTGGAAACTCAATCATACTGTCTATT 

4Q * ★* ** * ***** ** ** ** * ***** ***** ** ★* ** *** ** 

PPI-GmCPP CATTTGTTGCTATGCAG 

BAS F-Gm CATTTGTTGCTATGCAG 

AT4g01320 CGTTCATTGCAGTTCAA 

AFO 07269 CGTTCATTGCAGTTCAAGTGAGGCTC AACCGACAGTTCAAAAACTT ACTCACATCTACAT 

45 PPI-AtCPP CGTTCATTGCAGTTCAA 

BASF_AT2 CGTTCATTGCAGTTCAA 

afcl CGTTCATTGCAGTTCAA 

BASF_AT1 CGTTCATTGCAGTTCAA 

PPI-BnCPP CGTTCATTGCTGTTCAA 

50 BASF-Corn CCTTTGTAGCTGTCCAG 

**★ *** ★** 

PPI-GmCPP ATTCTTACA 

BAS F-Gm ATTCTTACA 

AT4g01320 ATCCTTGCC 

55 AF007 2 69 TTCACTTAAGAAATCATGTCTTATGACCCTCTCTCAATGTTTTGCTTGCAGATCCTTGCC 

PPI-AtCPP ATCCTTGCC 

BASF_AT2 ATCCTTGCC 

afcl ATCCTTGCC 

BAS F_AT 1 ATCCTTGCC 

60 PPI-BnCPP ATCCTTGCC 

BASF-Corn CTGCTTATG 

★ * * * 

PPI-GmCPP C T T C T AC AAT T T G G AGG AT AT AC AC T AGT G CG AAAT T C AG C T GAT C T G TAT C G AAG C T T T 

65 BAS F-Gm C T T CT AC AAT T T G G AGG AT AT AC AC T AGT G C G AAAT T C AG C T GAT CT G T AT C G AAG C T T T 

AT4g01320 TTCTTACAATTTGGAGGATACACTCTTGTCAGAAACTCCACTGATCTCTTCAGGAGTTTC 

AF0072 69 TTCTTACAATTTGGAGGATACACTCTTGTCAGAAACTCCACTGATCTCTTCAGGAGTTTC 

PPI-AtCPP TTCTTACAATTTGGAGGATACACTCTTCTCAGAAACTCCACTGATCTCTTCAGGAGTTTC 
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BASF_AT2 TTCTTACAATTTGGAGGATACACTCTTGTCAGAAACTCCACTGATCTCTTCAGGAGTTTC 
af cl TTCTTACAATTTGGAGGATACACTCTTGTCAGAAACTCCACTGATCTCTTCAGGAGTTTC 
BASF_AT1 TTCTTACAATTTGGAGGATACACTCTTGTCAGAAACTCCACTGATCTCTTCAGGAGTTTC 
PPI-BnCPP TTCTTGCAATTTGGAGGATACACTCTTGTCAGAAACTCCACTGATCTCTTCAGGAGTTTT 
BASF-Corn TTTCTTCAATTTGGAGGATATACTCTAGTAAGGAGCTCCAAAGATCTATTTGGAAGTTTT 
* * ************** ** ** * * * ** ***** * * ★ * ** 



10 



15 



PPI-GmCPP 

BASF-Gm 

AT4g01320 

AF007269 

PPI-AtCPP 

BASF_AT2 

afcl 

BAS F_AT 1 

PPI-BnCPP 

BASF-Corn 



GGGTTTGATACGCAGCCAGTCCTCATTGGGCTCATCATATTTCAG 

GGGTTTGATACGCAGCCAGTCCTCATTGGGCTCATCATATTTCAG 

G GAT T T GAT AC AC AG CCTGTTCT CAT T GG T T T G AT CAT AT T T C AG 

GGATTTGATACACAGCCTGTTCTCATTGGTTTGATCATATTTCAGGTTTGTTATTTTTGC 

GGATTTGATACACAGCCTGTTCTCATTGGTTTGATCATATTTCAG 

GGATTTGATACACAGCCTGTTCTCATTGGTTTGATCATATTTCAG 

GGATTTGATACACAGCCTGTTCTCATTGGTTTGATCATATTTCAG 

GGATTTGATACACAGCCTGTTCTCATTGGTTTGATCATATTTCAG 

GGTTTTGATACACAACCAGTTCTCATTGGTTTGATCATATTTCAG 

GGCTTCAAGGACCAGCCAGTAATAATTGGATTGATCATTTTCCCG 

** ** * ** ** ** * ***** * ***** ★ * * * 



20 PPI-GmCPP 

BASF-Gm 

AT4g01320 

AFO 07 2 69 CTTTTGACACT AATCT AATGAATCAAGGATGGATTAAGAAAAAAAAACTCTAAACCTTTG 

PPI-AtCPP 

25 BAS F_ AT 2 

afcl 

BAS F_AT 1 

PPI-BnCPP 

BASF-Corn 

30 PPI-GmCPP CAT AC T G T AAT C C C AC T T C AG C AAT T G G T C AG C 

BASF-Gm CATACTGTAATCCCACTTCAGCAATTGGTCAGC 

AT4g01320 CACACTGTAATACCACTGCAACATCTAGTAAGC 

AFO 07 2 69 GTTATATCTCCTGTCTGATTATCACAGCACACTGTAATACCACTGCAACATCTAGTAAGC 

PPI-AtCPP CACACTGTAATACCACTGCAACATCTAGTAAGC 

35 BASFJVT2 CACACTGTAATACCACTGCAACATCTAGTAAGC 

afcl CACACTGTAATACCACTGCAACATCTAGTAAGC 

BAS F_ ATI CACACTGTAATACCACTGCAACATCCAGTAAGC 

PPI-BnCPP CACACTGTAATACCACTTCAACACCTAGTAAGC 

BASF-Corn CACACCATAATACCCATCCAACACCTTCTGAGC 

Af) *★ ** ****** ***** **** 
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50 



55 



60 



PPI-GmCPP 

BASF-Gm 

AT4g01320 

AF007269 

PPI-AtCPP 

BAS F_ AT 2 

afcl 

BASF_AT1 

PPI-BnCPP 

BASF-Corn 

PPI-GmCPP 

BASF-Gm 

AT4g01320 

AF007269 

PPI-AtCPP 

BASF_AT2 

afcl 

BASF_AT1 

PPI-BnCPP 

BASF-Corn 



TTTGGTCTGAACCTAGTCAGCCGATCATTTGAATTTCAGG 

TTTGGTCTGAACCTAGTCAGCCGATCATTTGAATTTCAGG 

TTTGGCCTGAACCTCGTTAGTCGAGCGTTTGAGTTTCAGG 

TTTGGCCTGAACCTCGTTAGTCGAGCGTTTGAGTTTCAGGTACCATCTTACAATCCCTCA 

TTTGGCCTGAACCTCGTTAGTCGAGCGTTTGAGTTTCAGG 

TTTGGCCTGAACCTCGTTAGTCGAGCGTTTGAGTTTCAGG 

TTTGGCCTGAACCTCGTTAGTCGAGCGTTTGAGTTTCAGG 

TTTGGCCTCAACCTTGTTAGTCGAGCGTTTGAGTTTCAGG 

TTTGACCTCAACCTTGTTAGTCGAGCGTTTGAGTTTCAGG 

TTTCGCCTGAACCTTGTCAGCAGAGCATTTGAATTTCAGG 

*** ** ***** ** ** ** * ***** ******* 

AGATCCAACCATAGTTTCTTTATTGCAATGGCAGCCTCATCTACTAATCTGAGTTAACGT 



PPI-GmCPP CTGATGGCTTTGCCAAGAAGCTTGGATATGCATCTGGATTACGCGGTG 

65 BASF-Gm CTGATGGCTTTGCCAAGAAGCTTGGATATGCATCTGGATTACGCGGTG 

AT4g01320 CTGATGCTTTTGCTGTGAAGCTTGGCT ATGCAAAAGATCTTCGTCCTG 

AF0072 69 TCCTTTTGCAGGCTGATGCTTTTGCTGTGAAGCTTGGCTATGCAAAAGATCTTCGTCCTG 

PPI-AtCPP CTGATGCTTTTGCTGTGAAGCTTGACTATGCAAAAGATCTTCGTCCTG 
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BASF_AT2 CTGATGCTTTTGCTGTGAAGCTTGGCTATGCAAAAGATCTTCGTCCTG 

afcl CTGATGCTTTTGCCGTGAAGCTTGGCTATGCAAAAGATCTTCGTCCTG 

BASF_AT1 CTGATGCTTTTGCTGTGAAGCTTGGCTATGCAAAAGATCTTCGTCCTA 

PPI-BnCPP CTGATGCTTTTGCAGTGAATCTTGGTTATGCAAAGGATCTACGTCCTG 

5 BASF-Corn CTGATGCCTTTGCCAAGAACCTTGGATATGCCCCTCAGCTCCGAGCAG 

****** ***** *** **** ***** * ** 

PPI-GmCPP GTCTTGTGAAACTACAGG 

BASF-Gm GTCTTGTGAAACTACAGG 

10 AT4g01320 CTCTAGTGAAACTACAGGTCAGAGAAGATAACAACAGAACACAAACTGTTACCTCAATTT 

AF00 7 269 CTCTAGTGAAACTACAGGTCAGAGAAGATAACAACAGAACACAAACTGTTACCTCAATTT 

PPI-AtCPP CTCTAGTGAAACTACAGG 

BASF_AT2 CTCTAGTGAAACTACAGG 

afcl CTCTAGTGAAACTACAGG 

15 BAS F__AT 1 CTCTAGTGAAACTACAGG 

PPI-BnCPP CCCTAGTGAAGCTACAGG 

BASF-Corn CCCTTGTTAAACTACAGG 

* * * * ** ******* 

PPI-GmCPP AG G AG AAT C T G T C AG C T A 

20 BASF-Gm AGGAGAATCTGTCAGCTA 

AT4g01320 GTGTCACACACTTAAATGGATTTTTTGTTGGGATTTTGCAGGAAGAGAACTTATCAGCAA 

AF007 2 69 GTGTCACACACTTAAATGGATTTTTTGTTGGGATTTTGCAGGAAGAGAACTTATCAGCAA 

PPI-AtCPP AAG AGAACTT ATC AACAA 

BASF_AT2 AAG AG AAC T TAT C AG C AA . 

25 afcl AAG AG AAC T TAT C AG C AA 

BAS F_AT 1 : AAGAGAACTTATCAGCAA 

PPI-BnCPP AAG AG AAC T TAT C AG C G A 

BASF-Corn AGGAGAACTTGTCTGCGA 

* ***** * ** * * 

30 

PPI-GmCPP TGAATACAGATCCTTGGTACTCTGCTTATCACTATTCTCATCCTCCCCTTGTTGAAAGAT 

BASF-Gm TGAATACAGATCCTTGCT--CGTGCCG 

AT4g01320 TGAACACTGATCCATTGTACTCAGCTTATCACTACTCACATCCTCCTCTTGTTGAAAGGC 

AF007 2 69 TGAACACTGATCCATTGTACTCAGCTTATCACTACTCACATCCTCCTCTTGTTGAAAGGC 

35 PPI-AtCPP TGAACACTGATCCATTGTACTCAGCTTATCACTACTCACATCCTCCTCTTGTTGAAAGGC 

BASF_AT2 TGAAAACTGATCTATTGTACTCAGCTTATCACTACTCACATCCTCCTCTTGTTGAAAGGC 

afcl TGAACACTGATCCATTGCACTCAGCTTATCACTACTCACATCCTCCTCTTGTTGAAAGGC 

BASF_AT1 TGAATACTGATCCATTGTACTCAGCTTATCACTACTCACATCCTCCTCTTGTTGAAAGGC 

PPI-BnCPP TGAACACAGACCCATTGTACTCAGCTTATCACTACTCACACCCTCCTCTTGTAGAGAGGC 

40 BASF-Corn TGAACACCGATCCTTGGTATTCGGCATATCACTACTCCCACCCACCACTCGTCGAGAGGC 

********* * ** 
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50 



PPI-GmCPP 

BASF-Gm 

AT4g01320 

AF007269 

PPI-AtCPP 

BASF_AT2 

afcl 

BAS F_AT 1 

PPI-BnCPP 

BASF-Corn 



TGGCCGCGCTGGACGA AC C G GAT AAG AAG G AAG AC T AA 

TTCGAGCCATTGATGG AG AAG AC AAG AAG AC AG AT TAA 

TTCGAGCCATTGATGG AG AAG AC AAG AAG AC AG AT TAA 

TTCGAGCCACTGATGG AG AAG AC AAG AAG AC AG AT TAA 

TTCGAGCCATTGATGG AGAAGACAAGAAGACAGATTAA 

TTCGAGCCATTGATGG AGAAGACAAGAAGACAGATTAA 

TTCGAGCCATTGATGG AGAAGACAAGAAGACAGATTAA 

TTCGAGCCATTGATGG AGAAGACAAGAAGACAGATTAA 

T G C AAG C T T T G G AAG AT T C AG AC G AC AAAAAAG AAG AT T AGT C GAT C C T T GT AT GAG G T T 



55 PPI-GmCPP T 

BASF-Gm 

AT4g01320 

AF007269 

PPI-AtCPP 

60 BAS F_ AT 2 

afcl 

BASF_AT1 

PPI-BnCPP 

BASF-Corn TACATATGGATTTTTCCCTGCCACATGCACACCGATTCAGTGCTTGGATGGTGAGGGTTT 

65 

PPI-GmCPP 

BASF-Gm 

AT4g01320 
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AF007269 

PPI-AtCPP 

BASF_AT2 

afcl 

BAS F_AT 1 

PPI-BnCPP 

BASF-Corn TGACATAGGAGTGTTGTCAAAGCTTTAGAGTGCATCTTTCGGTCAGGTGCAACAGCCTTT 



10 PPI-GmCPP 

BASF-Gm 

AT4g01320 

AF007269 

PPI-AtCPP 

15 BAS F_ AT 2 

afcl 

BAS F_AT 1 

PPI-BnCPP 

BASF-Corn CGGTCATTGAGACATATAAGCGAATTAGCTATTAAAAAAAACAGAACTGTTGCATCAAAA 

20 

PPI-GmCPP 

BASF-Gm 

AT4g01320 

25 AF007269 

PPI-AtCPP 

BASF_AT2 

afcl 

BASF_AT1 

30 PPI-BnCPP 

BASF-Corn AAAAAAAAAAAAAAGAAACAAAAAAAAAAAAAAAAAAAAAAAAGAAAAAAAAAAAAAAAA 



PPI-GmCPP 

35 BASF-Gm 

AT4g01320 

AF007269 

PPI-AtCPP 

BASF_AT2 

40 afcl 

BAS F_AT 1 

PPI-BnCPP 

BASF-Corn AAAAAGTGCTCTGCGTTGTTACCACTGCTTGCCCTATAGTGATCGTATCAGA 
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Table 6B. ClustalW Amino Acid Analysis of CaaX Prenyl Protease 



1 : PPI-AtCPP SEQ ID NO:2 

2: PPI-BnCPP SEQ ID NO: 15 

50 3: PPI-GmCPP SEQ ID NO: 18 

4: BASF_AT1 SEQ ID NO:22 

5 : BASF_AT2 SEQ ID NO:24 

6: BASF-Corn SEQ ID NO:26 

7: BASF-Gm SEQ ID NO:28 
55 8:AFC1 SEQIDNO:30 

9: AT4g01320 SEQ ID NO:32 

10: AF007269 SEQ ID NO:34 

PPI-GmCPP MAFPYMEAWGFMILMYIFETYLDVRQHRALKLPTLPKTLEG VISQEKFEKSR 

60 BASF-Gm MAFPYMEAVVGFMILMYIFETYLDVRQHRALKLPTLPKTLEG VISQEKFEKSR 

AF0072 69 MAIPFMETVVGFMIVMYIFETYLDLRQLTALKLPTLPKTLI 

AT4g-AtCPP MAIPFMETVVGFMIVMYIFETYLDLRQLTALKLPTLPKTLVGVISQEKFEKSRAYRDIIT 
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BASF_AT2 MAI PFMETVVGFMI VMYI FETYLDLRQLTALKLPTLPKTLVG VISQEKFEKSR 

AFC1 MAI PFMETVVGFMI VMYI FETYLDLRQLTALKLPTLPKTLVG VISQEKFEKSR 

BASF_AT1 MAI PFMETVVGFMI VMYI FETYLDLRQLTALKLPTLPKTLVG VISQEKFEKSR 

PPI-AtCPP MAI PFMETVVGFMI VMYI FETYLDLRQLTALKLPTLPKTLVG VISQEKFEKSR 

5 PPI-BnCPP MAIPFMETWGFMIVMYVFETYLDLRQHTALKLPTLPKTLVG VISQEKFEKSR 

BASF-Corn 



PPI-GmCPP AYSLDKSHFHFVHEFVTIVTDSTILYFGVLPWFWKKSGDFMTIAGFNAENEILHTLAFLA 

10 BASF-Gm AYSLDKSHFHFVHEFVTIVTDSTILYFGVLPWFWKKSGDFMTIAGFNAENEILHTLAFLA 

AF007269 

AT4g-AtCPP ENFNICSYFHFVHE FVTILMDSAILFFGILPWFWKMSGAVLPRLGLDPENEILHTLSFLA 

BASF_AT2 AYSLDKSYFHFVHEFVTILMDSAILFFGILPWFWKMSGAVLPRLGLDPENEILHTLSFLA 

AFC1 AYSLDKSYFHFVHEFVTILMDSAILFFGILPWFWKMSGAVLPRLGLDPENEILHTLSFLA 

15 BASF_AT1 AYSLDKSYFHFVHEFVTILMDSAILFFGILPWFWKMSGAVLPRLGLDPENEILHTLSFLA 

PPI-AtCPP AYSLDKSYFHFVHEFVTILMDSAILFFGILPWFWKMSGAVLPRLGLDPENEILHTLSFLA 

PPI-BnCPP AYSLDKSHFHFVHEFVTILMDSAILFFGILPWFWKISGGFLPMVGLDPENEILHTLSFLA 

BASF-Corn TRLSAENE I IHTLAFLA 



20 



25 



30 



PPI-GmCPP GLMIWSQITDLPFSLYSTFVIEARHGFNKQT PWLFFRDMLKGIFLSVIIGPPIVAAIIVI 

BASF-Gm GLMIWSQITDLPFSLYSTFVIEARHGFNKQT PWLFFRDMLKGIFLSVIIGPPIVAAIIVI 

AF0072 69 TDLPFSLYSTFVIESRHGFNKQTIWMFIRDMIKGTFLSVILGPPIVAAIIFI 

AT4g-AtCPP GVMTWSQITDLPFSLYSTFVIESRHGFNKQTIWMFIRDMIKGTFLSVILGPPIVAAIIFI 

BASF_AT2 GVMTWSQITDLPFSLYSTFVIESRHGFNKQTIWMFIRDMIKGTFLSVILGPPIVAAIIFI 

AFC1 GVMTWSQITDLPFSLYSTFVIESRHGFNKQTIWMFIRDMIKGTFLSVILGPPIVAAIIFI 

BASF_AT1 GVMTWSQITDLPFSLYSTFVIESRHGFNKQTIWMFIRDMIKGTFLSVILGPPIVAAIIFI 

PPI-AtCPP GVMTWSQITDLPFSLYSTFVIESRHGFNKQTIWMFIRDMIKGTFLSVILGPPIVAAIIFI 

PPI-BnCPP GLMTWSQITDLPFSLYSTFVIESRHGFNKQTIWMFIRDMIKGILLSVIPAPPIVAAIIVI 

BASF-Corn GSMVWSQITDLPFSLYSTFVIEARHGFNKQTIWLFIRDMIKGILLSMILGPPIVAAIIYI 

**************.******* * *.*.***.** .**.* ******** * 



PPI-GmCPP VQKGGPYLAIYLWVFTFGLSIVMMTLYPVLIAPLFNKFTPLPDGQLREKIEKLASSLNYP 

BASF-Gm VQKGGPYLAIYLWVFTFGLSIVMMTLYPVLIAPLFNKFTPLPDGQLREKIEKLASSLNYP 

35 AF0072 69 VQKGGPYLAIYLWAFMFILSLVMMTIYPVLIAPLFNKFTPLPDGDLREKIEKLASSLKFP 

AT4g-AtCPP VQKGGPYLAIYLWAFMFILSLVMMTIYPVLIAPLFNKFTPLPDGDLREKIEKLASSLKFP 

BASF_AT2 VQKGGPYLAIYLWAFMFILSLVMMTIYPVLIAPLFNKFTPLPDGDLREKIEKLASSLKFP 

AFC1 VQKGGPYLAIYLWAFMFILSLVMMTIYPVLIAPLFNKFTPLPDGDLREKIEKLASSLKFP 

BASF_AT1 VQKGGPYLAIYLWAFMFILSLVMMTIYPVLIAPLFNKFTPLPDGDLREKIEKLASSLKFP 

40 PPI-AtCPP VQKGGPYLAIYLWAFMFILSLVMMTIYPVLIAPLFNKFTPLPDGDLREKIEKLASSLKFP 

PPI-BnCPP VQKGGPYLAIYLWAFMFILSLVMMTIYPVLIAPLFNKFTPLPDGDLREKIEKLASSLKFP 

BASF-Corn VQIGGPYLAIYLWGFMFVLALLMMTIYPIVIAPLFNKFTPLPEGVLREKIEKLAASLKFP 
* * ********** * * *...***.**..**********★*.★ *★★******.**..* 

45 

PPI-GmCPP LKKLFWDGSTRSSHSNAYMYGFFKNKRIVPYDTLIQQCKDDEEIVAVIAHELGHWKLNH 

BASF-Gm LKKLFVVDGSTRSSHSNAYMYGFFKNKRIVLYDTLIQQCKDDEEIVAVIAHELGHWKLNH 

AF0072 69 LKKLFVVDGSTRSSHSNAYMYGFFKNKRIVLYDTLIQQCKNEDEIVAVIAHELGHWKLNH 

50 AT4g-AtCPP LKKLFVVDGSTRSSHSNAYMYGFFKNKRIVLYDTLIQQCKNEDEIVAVIAHELGHWKLNH 

BASF_AT2 LKKLFWDGSTRSSHSNAYMYGFFKNKRIVLYDTLIQQCKNEDEIVAVIAHELGHWKLNH 

AFC1 LKKLFVVDGSTRSSHSNAYMYGFFKNKRIVLYDTLIQQCKNEDEIVAVIAHELGHWKLNH 

BAS F_AT 1 LKKLFWDGSTRSSHSNAYMYGFFKNKRIVLYDTLIQQCKNEDEIVAVIAHELGHWKLNH 

PPI-AtCPP LKKLFVVDGSTRSSHSNAYMYGFFKNKRIVLYDTLIQQCKNEDEIVAVIAHELGHWKLNH 

55 PPI-BnCPP LKKLFVVDGSTRSSHSNAYMYGFFKNKRIVLYDTLIQQCQNENEIVAVIAHELGHWKLNH 

BASF-Corn LKKLFWDGSTRSSHSNAYMYGFFKNKRIVLYDTLIQQCSNEDEIVSVIAHELGHWKLNH 
****************************** ******** ...***.************* 



PPI-GmCPP TVYTFVAMQILTLLQFGGYTLVRNSADLYRSFGFDTQPVLIGLII FQHTVIPLQQLVSFG 

60 BASF-Gm TVYTFVAMQILTLLQFGGYTLVRNSADLYRSFGFDTQPVLIGLII FQHTVIPLQQLVSFG 

AF0072 69 TTYSFIAV QHTVI PLQHLVSFG 

AT4g-AtCPP TTYS FIAVQILAFLQFGGYTLVRNSTDLFRSFGFDTQPVLIGLI I FQHTVI PLQHLVSFG 

BASF_AT2 TTYSFIAVQILAFLQFGGYTLVRNSTDLFRSFGFDTQPVLIGLI I FQHTVI PLQHLVSFG 

AFC1 TTYS FI AVQI LAFLQFGGYTLVRNSTDLFRS FGFDTQPVL IGLI I FQHTVI PLQHLVS FG 

65 BAS F_AT 1 TTYSFIAVQILAFLQFGGYTLVRNSTDLFRSFGFDTQPVLIGLI I FQHTVI PLQHPVSFG 

PPI-AtCPP TTYSFIAVQILAFLQFGGYTLLRNSTDLFRSFGFDTQPVLIGLI I FQHTVI PLQHLVSFG 

PPI-BnCPP TTYSFIAVQILAFLQFGGYTLVRNSTDLFRSFGFDTQPVLIGLI I FQHTVI PLQHLVSFD 

BASF-Corn TVYSFVAVQLLMFLQFGGYTLVRSSKDLFGSFGFKDQPVIIGLII FPHTIIPIQHLLSFR 
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10 



PPI-GmCPP 

BASF-Gm 

AF007269 

AT4g-AtCPP 

BAS F__AT 2 

AFC1 

BASF_AT1 

PPI-AtCPP 

PPI-BnCPP 

BASF-Corn 



LNLVSRSFEFQADGFAKKLGYASGLRG- 
LNLVS R S FE FQADGFAKKLG Y AS GLRG - 
LNLVSRAFEFQADAFAVKLGYAKDLR— 
LNLVSRAFEFQADAFAVKLGYAKDLR-- 
LNLVSRAFE FQADAFAVKLG YAKDLR- - 
LNLVSRAFE FQADAFAVKLGYAKDLR- - 



-PALV KLQVREDNNRTQ 

-PALV KLQVRE DNNRTQT VT S I C V 

-PALV KLQE 

- PALVKLQE 



LNLVSRAFE FQADAFAVKLGYAKDLRPTLVKLQ- 
LNLVSRAFE FQADAFAVKLDYAKDLRPALVKLQ- 

LNLVSRAFE FQADAFAVNLGYAKDLRP 

LNLVSRAFE FQADAFAKNLGYAPQLR 



1 5 PPI-GmCPP GLVKLQEENLSAMNTDPWYSAYHYSHPPLVERLAALDEPDKKED- 

BASF-Gm GLVKLQEENLSAMNTDPCSC 

AF0072 69 TEENLSAMNTDPLYSAYHYSHPPLVERLRAIDGEDKKTD- 

AT4g-AtCPP THLNGFFVGILQEENLSAMNTDPLYSAYHYSHPPLVERLRAIDGEDKKTD- 

BASF_AT2 ENLSAMNTDPLYSAYHYSHPPLVERLRAIDGEDKKTD- 

20 AFC1 ENLSAMNTDPLHSAYHYSHPPLVERLRAIDGEDKKTD- 

BAS F_AT 1 EENLSAMNTDPLYSAYHYSHPPLVERLRAIDGEDKKTD- 

PPI-AtCPP EENLSTMNTDPLYSAYHYSHPPLVERLRATDGEDKKTD- 

PPI-BnCPP ALVKLQEENLSAMNTDPLYSAYHYSHPPLVERLRAIDGEDKKTD- 

BASF-Corn AALVKLQEENLSAMNTDPWYSAYHYSHPPLVERLQALEDSDDKKED 

25 



Example 4: Plant Transformation 

30 Arabidopsis transgenic plants were made by the method of dipping flowering 

plants into an Agrobacterium culture, based on the method of Andrew Bent in, Clough 
SJ and Bent AF, 1998. Floral dipping: a simplified method for Agrobacterium-mediatzd 
transformation of Arabidopsis thaliana. Wild type plants were grown under standard 
conditions until the plant has both developing flowers and open flowers. The plant was 

35 inverted for 2 minutes into a solution of Agrobacterium culture carrying the appropriate 
gene construct. Plants were then left horizontal in a tray and kept covered for two days to 
maintain humidity and then righted and bagged to continue growth and seed 
development. Mature seed was bulk harvested. 

Transformed Tl plants were selected by germination and growth on MS plates 
40 containing 50 |ag/ml kanamycin. Green, kanamycin resistant (Kan R ) seedlings were 
identified after 2 weeks growth and transplanted to soil. Plants were bagged to ensure 
self fertilization and the T2 seed of each plant harvested separately. During growth of Tl 
plants leaf samples were harvested, DNA extracted and Southern blot and PCR analysis 
performed. 
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T2 seeds were analysed for Kan R segregation. From those lines that showed a 3:1 
resistant phenotype, surviving T2 plants were grown, bagged during seed set, and T3 
seed harvested from each line. T3 seed was again used for Kan R segregation analysis and 
those lines showing 100% Kan R phenotype were selected as homozygous lines. Further 
5 molecular and physiological analysis was done using T3 seedlings. 

Transgenic Brassica napus, Glycine max and Zea maize plants were produced 
using Agrobacterium mediated transformation of cotyledon petiole tissue. Seeds were 
sterilized as follows. Seeds were wetted with 95% ethanol for a short period of time such 
as 15 seconds. Approximately 30 ml of sterilizing solution I was added (70% Javex , 

10 lOOfil Tween20) and left for approximately 15 minutes. Solution I was removed and 
replaced with 30 ml of solution II (0.25% mecuric chloride, 100|al Tween20) and 
incubated for about 10 minutes. Seeds were rinsed with at least 500 ml double distilled 
sterile water and stored in a sterile dish. Seeds were germinated on plates of V2 MS 
medium, pH 5.8, supplemented with 1% sucrose and 0.7% agar. Fully expanded 

1 5 cotyledons were harvested and placed on Medium I (Murashige minimal organics 
(MMO), 3% sucrose, 4.5 mg/L benzyl adenine (BA), 0.7% phytoagar, pH5.8). An 
Agrobacterium culture containing the nucleic acid construct of interest was grown for 2 
days in AB Minimal media. The cotyledon explants were dipped such that only the cut 
portion of the petiole is contacted by the Agrobacterium solution. The explants were then 

20 embedded in Medium I and maintained for 5 days at 24°C, with 16,8 hr light dark cycles. 

Explants were transferred to Medium II (Medium I, 300 mg/L timentin,) for a 
further 7 days and then to Medium III (Medium II, 20 mg/L kanamycin). Any root or 
shoot tissue which had developed at this time was dissected away. Transfer explants to 
fresh plates of Medium III after 14-21 days. When regenerated shoot tissue developed 

25 the regenerated tissue was transferred to Medium IV (MMO, 3% sucrose, 1.0% 

phytoagar, 300 mg/L timentin, 20 mg/L 20 mg/L kanamycin). Once healthy shoot tissue 
developed shoot tissue dissected from any callus tissue was dipped in 10X IB A and 
transferred to Medium V (Murashige and Skooge (MS), 3% sucrose, 0.2 mg/L indole 
butyric acid (IB A), 0.7% agar, 300 mg/L timentin, 20 mg/L 20 mg/L kanamycin) for 

30 rooting. Healthy plantlets were transferred to soil. The above method, with or without 
modifications, is suitable for the transformation of numerous plant species including 
Glycine max, Zea maize and cotton. 
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Transgenic Glycine max, Zea maize and cotton can be produced using 
Agrobacterium-based methods which are known to one of skill in the art. Alternatively 
one can use a particle or non-particle biolistic bombardment transformation method. An 
example of non-particle biolistic transformation is given in U.S. Patent Application 

5 20010026941 . This method has been used to produce transgenic Glycine max and Zea 
maize plants. Viable plants are propagated and homozygous lines are generated. Plants 
are tested for the presence of drought tolerance, physiological and biochemical 
phenotypes as described elsewhere. 

The following table identifies the constructs and the species which they have 

1 0 been transformed. 

Table 7 Transformation List 



SEP ID NO: Construct Species Transformed 

4 pBII121-AtCPP A. thaliana, B. napus 

5 P BII121-HP-AtCPP A. thaliana 

15 36 pRD29A-AtCPP A. thaliana, B. napus 

37 P RD29A-HP-AtCPP A. thaliana 

39 MuA-AtCPP Glycine max, Zea mays 



Non-limiting examples of vector constructs suitable for plant transformation are given in 
20 SEQ ID NO: 4, 5, 35-53. 

SEQ ID NO:4 

gtttacccgccaa ta ta tec tgtcaaacactgatagtttaaactgaaggcgggaaacga 
caatctgatcatgagcggagaattaagggagtcacgttatgacccccgccgatgacgcg 
ggacaagccgttttacgtttggaactgacagaaccgcaacgttgaaggagccactcagc 

25 cgcgggtttctggagtttaatgagctaagcacatacgtcagaaaccattattgcgcgtt 
caaaagtcgcctaaggtcactatcagctagcaaatatttcttgtcaaaaatgctccact 
gacgttccataaattcccctcggtatccaattagagtctcatattcactctcaatccaa 
ataatctgcaccggatctggatcgtttcgcatgattgaacaagatggattgcacgcagg 
ttctccggccgcttgggtggagaggctattcggctatgactgggcacaacagacaatcg 

30 gctgctctgatgccgccgtgttccggctgtcagcgcaggggcgcccggttctttttgtc 
aagaccgacctgtccggtgccctgaatgaactgcaggacgaggcagcgcggctatcgtg 
gctggccacgacgggcgttccttgcgcagctgtgctcgacgttgtcactgaagcgggaa 
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gggactggctgctattgggcgaagtgccggggcaggatctcctgtcatctcaccttgct 
cctgccgagaaagtatccatcatggctgatgcaatgcggcggctgcatacgcttgatcc 
ggctacctgcccattcgaccaccaagcgaaacatcgcatcgagcgagcacgtactcgga 
tggaagccggtcttgtcgatcaggatgatctggacgaagagcatcaggggctcgcgcca 
gccgaactgttcgccaggctcaaggcgcgcatgcccgacggcgatgatctcgtcgtgac 
ccatggcgatgcctgcttgccgaatatcatggtggaaaatggccgcttttctggattca 
tcgactgtggccggctgggtgtggcggaccgctatcaggacatagcgttggctacccgt 
gatattgctgaagagcttggcggcgaatgggctgaccgcttcctcgtgctttacggtat 
cgccgctcccgattcgcagcgcatcgccttctatcgccttcttgacgagttcttctgag 
cgggactctggggttcgaaatgaccgaccaagcgacgcccaacctgccatcacgagatt 
tcgattccaccgccgccttctatgaaaggttgggcttcggaatcgttttccgggacgcc 
ggctggatgatcctccagcgcggggatctcatgctggagttcttcgcccacgggatctc 
tgcggaacaggcggtcgaaggtgccgatatcattacgacagcaacggccgacaagcaca 
acgccacgatcctgagcgacaatatgatcgggcccggcgtccacatcaacggcgtcggc 
ggcgactgcccaggcaagaccgagatgcaccgcgatatcttgctgcgttcggatatttt 
cgtggagttcccgccacagacccggatgatccccgatcgttcaaacatttggcaataaa 
gtttcttaagattgaatcctgttgccggtcttgcgatgattatcatataatttctgttg 
aattacgttaagcatgtaataattaacatgtaatgcatgacgttatttatgagatgggt 
ttttatgattagagtcccgcaattatacatttaatacgcgatagaaaacaaaatatagc 
gcgcaaactaggataaattatcgcgcgcggtgtcatctatgttactagatcgggcctcc 
tgtcaatgctggcggcggctctggtggtggttctggtggcggctctgagggtggtggct 
ctgagggtggcggttctgagggtggcggctctgagggaggcggttccggtggtggctct 
ggttccggtgattttgattatgaaaagatggcaaacgctaataagggggctatgaccga 
aaatgccgatgaaaacgcgctacagtctgacgctaaaggcaaacttgattctgtcgcta 
ctgattacggtgctgctatcgatggtttcattggtgacgtttccggccttgctaatggt 
aatggtgctactggtgattttgctggctctaattcccaaatggctcaagtcggtgacgg 
tgataattcacctttaatgaataatttccgtcaatatttaccttccctccctcaatcgg 
ttgaatgtcgcccttttgtctttggcccaatacgcaaaccgcctctccccgcgcgttgg 
ccgattcattaatgcagctggcacgacaggtttcccgactggaaagcgggcagtgagcg 
caacgcaattaatgtgagttagctcactcattaggcaccccaggctttacactttatgc 
ttccggctcgtatgttgtgtggaattgtgagcggataacaatttcacacaggaaacagc 
tatgaccatgattacgccaagcttgcatgcctgcag cccacagatggttagagaggctt 
acgcagcaggtctcatcaagacgatctacccgagcaataatctccaggaaatcaaatac 
cttcccaagaaggttaaagatgcagtcaaaagattcaggactaactgcatcaagaacac 
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agagaaagatatatttctcaagatcagaagtactattccagtatggacgattcaaggct 
tgcttcacaaaccaaggcaagtaatagagattggagtctctaaaaaggtagttcccact 
gaatcaaaggccatggagtcaaagattcaaatagaggacctaacagaactcgccgtaaa 
gactggcgaacagttcatacagagtctcttacgactcaatgacaagaagaaaatcttcg 
tcaacatggtggagcacgacacacttgtctactccaaaaatatcaaagatacagtctca 
gaagaccaaagggcaattgagacttttcaacaaagggtaatatccggaaacctcctcgg 
attccattgcccagctatctgtcactttattgtgaagatagtggaaaaggaaggtggct 
cctacaaatgccatcattgcgataaaggaaaggccatcgttgaagatgcctctgccgac 
agtggtcccaaagatggacccccacccacgaggagcatcgtggaaaaagaagacgttcc 
aaccacgtcttcaaagcaagtggattgatgtgatatctccactgacgtaagggatgacg 
cacaatcccactatccttcgcaagacccttcctctatataaggaagttcatttcatttg 
gagagaacacgggg gactctagaggatcca'bggcga'b-bccii'b'bcatggaaaccgtcg'bg 
ggttttatgatagtgatgtacatttttgagacgtatttggatctgaggcaactcactgc 
tctcaagcttccaactctcccgaaaaccttggttggtgtaattagccaagagaagtttg 
agaaatcacgagcatacagtcttgacaaaagctattttcactttgttcatgagtttgta 
actatacttatggactctgcaattttgttctttgggatcttgccttggttttggaagat 
gtctggagctgttttaccgaggttgggccttgatccggagaatgaaatactgcatactc 
tttcattcttggctggtgttatgacatggtcacagat^ 

tactcaactttcgtgatcgagtctcggcatgggttcaacaaacaaacaatatggatgtt 
cattagggacatgatcaaaggaacattcctctctgtcatactaggcccacccattgttg 
ctgcgataattttcatagtccagaaaggaggtccttatcttgccatctatctgtgggca 
ttcatgtttatcctgtctctagtgatgatgactatatacccggtcttgatagcaccgct 
cttcaacaaattcactcctcttccagatggagacctccgggagaagattgagaaacttg 
cttcttccctaaagtttcctttgaagaagctgtttgttgtcgatggatctacaaggtca 
agccatagcaatgcttacatgtatggtttctttaagaacaaaaggattgttctttatga 
tacgttgattcagcagtgcaagaatgaggatgaaattgtggcggttattgcacacgagc 
ttggacattggaaactgaatcacactacatactcgttcattgcagttcaaatccttgcc 
ttcttacaatttggaggatacactcttctcagaaactccactgatctcttcaggagttt 
cggatttgatacacagcctgttctcattggtttgatcatatttcagcacactgtaatac 
cactgcaacatctagtaagctttggcctgaacctcgttagtcgagcg-tttgagtttcag 
gctgatgcttttgctgtgaagcttgactatgcaaaagatcttcgtcctgctctagtgaa 
actacaggaagagaacttatcaacaatgaacactgatccattgtactcagcttatcact 
actcacatcctcctcttgttgaaaggcttcgagccactgatggagaagacaagaagaca 
gattaacccctcgaatttccccgatcgttcaaacatttggcaataaagtttcttaagat 
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tgaatcctgttgccggtcttgcgatgattatcatataatttctgttgaattacgttaag 
catgtaataattaacatgtaatgcatgacgttatttatgagatgggtttttatgattag 
agtcccgcaattatacatttaatacgcgatagaaaacaaaatatagcgcgcaaactagg 
ataaattatcgcgcgcggtgtcatctatgttactagatcgggaattcactggccgtcgt 

5 tttacaacgtcgtgactgggaaaaccctggcgttacccaacttaatcgccttgcagcac 
atccccctttcgccagctggcgtaatagcgaagaggcccgcaccgatcgcccttcccaa 
cagttgcgcagcctgaatggcgcccgctcctttcgctttcttcccttcctttctcgcca 
cgttcgccggctttccccgtcaagctctaaatcgggggctccctttagggttccgattt 
agtgctttacggcacctcgaccccaaaaaacttgatttgggtgatggttcacgtagtgg 

10 gccatcgccctgatagacggtttttcgccctttgacgttggagtccacgttctttaata 
gtggactcttgttccaaactggaacaacactcaaccctatctcgggctattcttttgat 
ttataagggattttgccgatttcggaaccaccatcaaacaggattttcgcctgctgggg 
caaaccagcgtggaccgcttgctgcaactctctcagggccaggcggtgaagggcaatca 
gctgttgcccgtctcactggtgaaaagaaaaaccaccccagtacattaaaaacgtccgc 

15 aatgtgttattaagttgtctaagcgtcaatttgtttacaccacaa ta tatcctgcca 



SEQ ID NO:4 is the nucleic acid sequence of pBI121-AtCPP. Italicized 
sequences are the right and left border repeats. Underlined sequence is the 35S promoter 
and bolded sequence is the AtCPP sense sequence. 

20 SEQ ID NO:5 

gt t ta cccgcca a tatatcctgt caaacactgatagtttaaactgaaggcgggaaacga 
caatctgatcatgagcggagaattaagggagtcacgttatgacccccgccgatgacgcg 
ggacaagccgttttacgtttggaactgacagaaccgcaacgttgaaggagccactcagc 
cgcgggtttctggagtttaatgagctaagcacatacgtcagaaaccattattgcgcgtt 

25 caaaagtcgcctaaggtcactatcagctagcaaatatttcttgtcaaaaatgctccact 
gacgttccataaattcccctcggtatccaattagagtctcatattcactctcaatccaa 
ataatctgcaccggatctggatcgtttcgcatgattgaacaagatggattgcacgcagg 
ttctccggccgcttgggtggagaggctattcggctatgactgggcacaacagacaatcg 
gctgctctgatgccgccgtgttccggctgtcagcgcaggggcgcccggttctttttgtc 

30 aagaccgacctgtccggtgccctgaatgaactgcaggacgaggcagcgcggctatcgtg 
gctggccacgacgggcgttccttgcgcagctgtgctcgacgttgtcactgaagcgggaa 
gggactggctgctattgggcgaagtgccggggcaggatctcctgtcatctcaccttgct 
cctgccgagaaagtatccatcatggctgatgcaatgcggcggctgcatacgcttgatcc 
ggctacctgcccattcgaccaccaagcgaaacatcgcatcgagcgagcacgtactcgga 

35 tggaagccggtcttgtcgatcaggatgatctggacgaagagcatcaggggctcgcgcca 
gccgaactgttcgccaggctcaaggcgcgcatgcccgacggcgatgatctcgtcgtgac 
ccatggcgatgcctgcttgccgaatatcatggtggaaaatggccgcttttctggattca 
tcgactgtggccggctgggtgtggcggaccgctatcaggacatagcgttggctacccgt 
gatattgctgaagagcttggcggcgaatgggctgaccgcttcctcgtgctttacggtat 

40 cgccgctcccgattcgcagcgcatcgccttctatcgccttcttgacgagttcttctgag 
cgggactctggggttcgaaatgaccgaccaagcgacgcccaacctgccatcacgagatt 
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tcgattccaccgccgccttctatgaaaggttgggcttcggaatcgttttccgggacgcc 
ggct ggat gat cctccagcgcggggatc teat get ggagttcttcgcccacgggat etc 
tgcggaacaggcggtcgaaggtgccgatatcattacgacagcaacggccgacaagcaca 
acgccacgatcctgagcgacaatatgatcgggcccggcgtccacatcaacggcgtcggc 
5 ggcgactgcccaggcaagaccgagatgcaccgcgatatcttgctgcgttcggatatttt 
cgtggagttcccgccacagacccggatgatccccgatcgttcaaacatttggcaataaa 
gtttcttaagattgaatcctgttgccggtcttgcgatgattatcatataatttctgttg 
aattacgttaagcatgtaataattaacatgtaatgcatgacgttatttatgagatgggt 
ttttatgattagagtcccgcaattatacatttaatacgcgatagaaaacaaaatatagc 

10 gcgcaaactaggataaattatcgcgcgcggtgtcatctatgttactagatcgggcctcc 
tgtcaat get ggcggcggctctggtggtggttctggtggcggctctgagggtggt ggct 
ctgagggtggcggttctgagggtggcggctctgagggaggcggttccggtggtggctct 
ggttccggtgattttgattatgaaaagatggcaaacgctaataagggggctatgaccga 
aaatgccgatgaaaacgcgctacagtctgacgctaaaggcaaacttgattctgtcgcta 

15 ctgattacggtgctgctatcgatggtttcattggtgacgtttccggccttgctaatggt 
aatggt get actggt gat tttget ggct ctaattcccaaat ggct caagteggtgaegg 
tgataattcacctttaatgaataatttccgtcaatatttaccttccctccctcaatcgg 
ttgaatgtcgcccttttgtctttggcccaatacgcaaaccgcctctccccgcgcgttgg 
ccgattcattaatgcagctggcacgacaggtttcccgactggaaagcgggcagtgagcg 

20 caacgcaattaatgtgagttagctcactcattaggcaccccaggctttacactttatgc 
ttccggctcgtatgttgtgtggaattgtgagcggataacaatttcacacaggaaacagc 
tatgaccatgattacgccaagcttgcatgcctgcag cccacagatggttagagaggctt 
acgcagcaggtctcatcaagacgatctacccgagcaataatctccaggaaatcaaatac 
cttcccaagaaggttaaagatgcagtcaaaagattcaggactaactgcatcaagaacac 

25 agagaaagatatatttctcaagatcagaagtactattccagtatggacgattcaaggct 
tgcttcacaaaccaaggcaagtaatagagattggagtctctaaaaaggtagttcccact 
gaatcaaaggccatggagtcaaagattcaaatagaggacctaacagaactcgccgtaaa 
gactggcgaacagttcatacagagtctcttacgactcaatgacaagaagaaaatcttcg 
tcaacatggtggagcacgacacacttgtctactccaaaaatatcaaagatacagtctca 

30 gaagaccaaagggcaattgagacttttcaacaaagggtaatatccggaaacctcctcgg 
attccattgcccagctatctgtcactttattgtgaagatagtggaaaaggaaggtggct 
cctacaaatgccateattgcgataaaggaaaggccatcgttgaagatgcctctgccgac 
agtggtcccaaagatggacccccacccacgaggagcatcgtggaaaaagaagacgttcc 
aaccacgtcttcaaagcaagtggattgatgtgatatctccactgacgtaagggatqacg 

35 cacaatcccactatecttcgcaagacccttcctctatataaggaagttcatttcatttg 
gagagaacacgggg gactctagaggatcctcccaatgtccaagctcgtgtgcaataacc 
gccacaatttcatcctcattcttgcactgctgaatcaacgtatcataaagaacaatcct 
tttgttcttaaagaaaccatacatgtaagcattgctatggcttgaccttgtagatccat 
cgacaacaaacagcttcttcaaaggaaactttagggaagaagcaagtttctcaatcttc 

40 tcccggaggtctccatctggaagaggagtgaatttgttgaagagcggtgctatcaagac 
cgggtatatagtcatcatcactagagacaggataaacatgaatgcccacagatagatgg 
caagataaggacctcctttctggactatgaaaattatcgcagcaacaatgggtgggcct 
agtatgacagagaggaatgttcctttgatcatgtccctaatgaacatccatattgtttg 
tttgttgaacccatgccgagactcgatcacgaaagttgagtacaaagaaaatggcaaat 

45 cagtgatctgtgaccatgtcataacaccagccaagaatgaaagagtatgcagtatttca 
ttctccggatcaaggcccaacctcggtaaaagaggatccccATCTACCCGCTTCGCGTC 
GGCATCCGGTCAGTGGCAGTGAAGGGCGAACAGTTCCTGATTAACCACAAACCGTTCTA 
CTTTACTGGCTTTGGTCGTCATGAAGATGCGGACTTGCGTGGCAAAGGATTCGATAACG 
TGCTGATGGTGCACGACCACGCATTAATGGACTGGATTGGGGCCAACTCCTACCGTACC 

50 TCGCATTACCCTTACGCTGAAGAGATGCTCGACTGGGCAGATGAACATGGCATCGTGGT 
GATTGATGAAACTGCTGCTGTCGGCTTTTCGCTCTCTTTAGGCATTGGTTTCGAAGCGG 
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GCAACAAGCCGAAAGAACTGTACAGCGAAGAGGCAGTCAACGGGGAAACTCAGCAAGCG 
CACTTACAGGCGATTAAAGAGCTGATAGCGCGTGACAAAAACCACCCAAGCGTGGTGAT 
GTGGAGTATTGCCAACGAACCGGATACCCGTCCGCAAGGTGCACGGGAATATTTCGCGC 
CACTGGCGGAAGCAACGCGTAAACTCGACCCGACGCGTCCGATCACCTGCGTCAATGTA 
5 ATGTTCTGCGACGCTCACACCGATACCATCAGCGATCTCTTTGATGTGCTGTGCCTGAA 
CCGTTATTACGGATGGTATGTCCAAAGCGGCGATTTGGAAACGGCAGAGAAGGTACTGG 
AAAAAGAACTTCTGGCCTGGCAGGAGAAACTGTACACCGACATGTGGAGTGAAGAGTAT 
CAGTGTGCATGGCTGGATATGTATCACCGCGTCTTTGATCGCGTCAGCGCCGTCGTCGG 
TGAACAGGTATGGAATTTCGCCGATTTTGCGACCTCGCAAGGCATATTGCGCGTTGGCG 

10 GTAACAAGAAAGGGATCTTCACTCGCGACCGCAAACCGAAGTCGGCGGCTTTTCTGCTG 
CAAAAACGCTGGACTGGCATGAACTTCGGTGAAAAACCGCAGCAGGGAGGCAAACAATG 
AATCAACAACTCTCCTGGCGCACCATCGTCGGCTACAGCCTCGGGAATTGCTACCGAGC 
TC t:tt:t:accgaggttgggccttgatccggagaa-bgaaatac-bgcat:actct:t:^cat:t:c-b 
-bggctgg-bg-btatgacatgg'bcacagatcac'bgat-btgcca-bt'btctttgtac^caact 

15 ttcgtgatcgagtctcggcatgggttcaacaaacaaacaatatggatgttcattaggga 
catgatcaaaggaacattcctctctgtcatactaggcccacccattgttgctgcgataa 
ttttca-bagt:ccagaaaggaggt:cctt:at:ct:i:gccatct:at:ci:gt:gggca-btcatgt-bt: 
at:cc < bgtctctagtgatgat:gact:atat:acccgg < bct:i:gatagcaccgct:c-bi:caacaa 
a-btcac , bcct:ct:'bccagat:ggagacc < bccgggagaaga , bt:gagaaacttgc-bt:ct:t:ccc 

20 taaagt-b-bcctttgaagaagctg'bttgt-bg-bcga-bggatctacaaggtcaagccatagc 
aatgcttacatgtatggtttctiitaagaacaaaaggattgt-bc'b'b-ba'bgatacgt'bgat 
tcagcagtgcaagaatgaggatgaaa-ttgtggcggt'battgcacacgagctitggaca'b-b 
ggqagctcgaatttccccgatcgttcaaacatttggcaataaagtttcttaagattgaa 
tcctgttgccggtcttgcgatgattatcatataatttctgttgaattacgttaagcatg 

25 taataattaacatgtaatgcatgacgttatttatgagatgggtttttatgattagagtc 
ccgcaattatacatttaatacgcgatagaaaacaaaatatagcgcgcaaactaggataa 
attatcgcgcgcggtgtcatctatgttactagatcgggaattcactggccgtcgtttta 
caacgtcgtgactgggaaaaccctggcgttacccaacttaatcgccttgcagcacatcc 
ccctttcgccagctggcgtaatagcgaagaggcccgcaccgatcgcccttcccaacagt 

30 tgcgcagcctgaatggcgcccgctcctttcgctttcttcccttcctttctcgccacgtt 
cgccggctttccccgtcaagctctaaatcgggggctccctttagggttccgatttagtg 
ctttacggcacctcgaccccaaaaaacttgatttgggtgatggttcacgtagtgggcca 
tcgccctgatagacggtttttcgccctttgacgttggagtccacgttctttaatagtgg 
actcttgttccaaactggaacaacactcaaccctatctcgggctattcttttgatttat 

35 aagggattttgccgatttcggaaccaccatcaaacaggattttcgcctgctggggcaaa 
ccagcgtggaccgcttgctgcaactctctcagggccaggcggtgaagggcaatcagctg 
ttgcccgtctcactggtgaaaagaaaaaccaccccagtacattaaaaacgtccgcaatg 
t gttatt aagt t gt ct aagcgtcaatt tgttta caeca caatatatcctgcca 

40 SEQ ID NO:5 is the nucleic acid sequence of pBI121-HP-AtCPP. Italicized 

sequences are the right and left border repeats. Underlined sequence is the 35S promoter 
and bolded sequence is the AtCPP anti-sense sequence. Sequence in upper case is the 
truncated GUS fragment. Sequence in bold and underlined is the AtCPP sense sequence. 

SEQ ID NO:35 

45 gtttacccgccaa ta ta tcctgtcaaacactgatagtttaaactgaaggcgggaaacga 

caatctgatcatgagcggagaattaagggagtcacgttatgacccccgccgatgacgcg 

ggacaagccgttttacgtttggaactgacagaaccgcaacgttgaaggagccactcagc 
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cgcgggtttctggagtttaatgagctaagcacatacgtcagaaaccattattgcgcgtt 
caaaagtcgcctaaggtcactatcagctagcaaatatttcttgtcaaaaatgctccact 
gacgttccataaattcccctcggtatccaattagagtctcatattcactctcaatccaa 
ataatctgcaccggatctggatcgtttcgcatgattgaacaagatggattgcacgcagg 
5 ttctccggccgcttgggtggagaggctattcggctatgactgggcacaacagacaatcg 
gctgctctgatgccgccgtgttccggctgtcagcgcaggggcgcccggttctttttgtc 
aagaccgacctgtccggtgccctgaatgaactgcaggacgaggcagcgcggctatcgtg 
gctggccacgacgggcgttccttgcgcagctgtgctcgacgttgtcactgaagcgggaa 
gggactggctgctattgggcgaagtgccggggcaggatctcctgtcatctcaccttgct 

10 cctgccgagaaagtatccatcatggctgatgcaatgcggcggctgcatacgcttgatcc 
ggctacctgcccattcgaccaccaagcgaaacatcgcatcgagcgagcacgtactcgga 
tggaagccggtcttgtcgatcaggatgatctggacgaagagcatcaggggctcgcgcca 
gccgaactgttcgccaggctcaaggcgcgcatgcccgacggcgatgatctcgtcgtgac 
ccatggcgatgcctgcttgccgaatatcatggtggaaaatggccgcttttctggattca 

15 tcgactgtggccggctgggtgtggcggaccgctatcaggacatagcgttggctacccgt 
gatattgctgaagagcttggcggcgaatgggctgaccgcttcctcgtgctttacggtat 
cgccgctcccgattcgcagcgcatcgccttctatcgccttcttgacgagttcttctgag 
cgggactctggggttcgaaatgaccgaccaagcgacgcccaacctgccatcacgagatt 
tcgattccaccgccgccttctatgaaaggttgggcttcggaatcgttttccgggacgcc 

20 ggctggatgatcctccagcgcggggatctcatgctggagttcttcgcccacgggatctc 
tgcggaacaggcggtcgaaggtgccgatatcattacgacagcaacggccgacaagcaca 
acgccacgatcctgagcgacaatatgatcgggcccggcgtccacatcaacggcgtcggc 
ggcgactgcccaggcaagaccgagatgcaccgcgatatcttgctgcgttcggatatttt 
cgtggagttcccgccacagacccggatgatccccgatcgttcaaacatttggcaataaa 

25 gtttcttaagattgaatcctgttgccggtcttgcgatgattatcatataatttctgttg 
aattacgttaagcatgtaataattaacatgtaatgcatgacgttatttatgagatgggt 
ttttatgattagagtcccgcaattatacatttaatacgcgatagaaaacaaaatatagc 
gcgcaaactaggataaattatcgcgcgcggtgtcatctatgttactagatcgggcctcc 
tgtcaatgctggcggcggctctggtggtggttctggtggcggctctgagggtggtggct 

30 ctgagggtggcggttctgagggtggcggctctgagggaggcggttccggtggtggctct 
ggttccggtgattttgattatgaaaagatggcaaacgctaataagggggctatgaccga 
aaatgccgatgaaaacgcgctacagtctgacgctaaaggcaaacttgattctgtcgcta 
ctgattacggtgctgctatcgatggtttcattggtgacgtttccggccttgctaatggt 
aatggtgctactggtgattttgctggctctaattcccaaatggctcaagtcggtgacgg 



WO 03/012116 PCT/IB02/03887 

tgataattcacctttaatgaataatttccgtcaatatttaccttccctccctcaatcgg 
ttgaatgtcgcccttttgtctttggcccaatacgcaaaccgcctctccccgcgcgttgg 
ccgattcattaatgcagctggcacgacaggtttcccgactggaaagcgggcagtgagcg 
caacgcaattaatgtgagttagctcactcattaggcaccccaggctttacactttatgc 
5 ttccggctcgtatgttgtgtggaattgtgagcggataacaatttcacacaggaaacagc 
tatqaccatgattacqccaaqcttgcatgcctgcag cccacagatggttagagaggctt 
acgcagcaggtctcatcaagacgatctacccgagcaataatctccaggaaatcaaatac 
cttcccaagaaggttaaagatgcagtcaaaagattcaggactaactgcatcaagaacac 
agagaaagatatatttctcaagatcagaagtactattccagtatggacgattcaaggct 

10 tgcttcacaaaccaaggcaagtaatagagattggagtctctaaaaaggtagttcccact 
gaatcaaaggccatggagtcaaagattcaaatagaggacctaacagaactcgccgtaaa 
gactggcgaacagttcatacagagtctcttacgactcaatgacaagaagaaaatcttcg 
tcaacatggtggagcacgacacacttgtctactccaaaaatatcaaagatacagtctca 
gaagaccaaagggcaattgagacttttcaacaaagggtaatatccggaaacctcctcgg 

15 attccattgcccagctatctgtcactttattgtgaagatagtggaaaaggaaggtggct 
cctacaaatgccatcattgcgataaaggaaaggccatcgttgaagatgcctctgccgac 
agtggtcccaaagatggacccccacccacgaggagcatcgtggaaaaagaagacgttcc 
aaccacgtcttcaaagcaagtggattgatgtgatatctccactgacgtaagggatgacg 
cacaatcccactatccttcgcaagacccttcctctatataaggaagttcatttcatttg 

20 gagagaacacgggg gactctagaggatccTTAATCTGTCTTCTTGTCTTCTCCATCAGT 
GGCTCGAAGCCTTTCAACAAGAGGAGGATGTGAGTAGTGATAAGCTGAGTACAATGGAT 
CAGTGTTCATTGTTGATAAGTTCTCTTCCTGTAGTTTCACTAGAGCAGGACGAAGATCT 
TTTGCATAGTCAAGCTTCACAGCAAAAGCATCAGCCTGAAACTCAAACGCTCGACTAAC 
GAGGTTCAGGCCAAAGCTTACTAGATGTTGCAGTGGTATTACAGTGTGCTGAAATATGA 

25 TCAAACCAATGAGAACAGGCTGTGTATCAAATCCGAAACTCCTGAAGAGATCAGTGGAG 
TTTCTGAGAAGAGTGTATCCTCCAAATTGTAAGAAGGCAAGGATTTGAACTGCAATGAA 
CGAGTATGTAGTGTGATTCAGTTTCCAATGTCCAAGCTCGTGTGCAATAACCGCCACAA 
TTTCATCCTCATTCTTGCACTGCTGAATCAACGTATCATAAAGAACAATCCTTTTGTTC 
TTAAAGAAACCATACATGTAAGCATTGCTATGGCTTGACCTTGTAGATCCATCGACAAC 

30 AAACAGCTTCTTCAAAGGAAACTTTAGGGAAGAAGCAAGTTTCTCAATCTTCTCCCGGA 
GGTCTCCATCTGGAAGAGGAGTGAATTTGTTGAAGAGCGGTGCTATCAAGACCGGGTAT 
ATAGTCATCATCACTAGAGACAGGATAAACATGAATGCCCACAGATAGATGGCAAGATA 
AGGACCTCCTTTCTGGACTATGAAAATTATCGCAGCAACAATGGGTGGGCCTAGTATGA 
CAGAGAGGAATGTTCCTTTGATCATGTCCCTAATGAACATCCATATTGTTTGTTTGTTG 
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AACCCATGCCGAGACTCGATCACGAAAGTTGAGTACAAAGAAAATGGCAAATCAGTGAT 
CTGTGACCATGTCATAACACCAGCCAAGAATGAAAGAGTATGCAGTATTTCATTCTCCG 
GATCAAGGCCCAACCTCGGTAAAACAGCTCCAGACATCTTCCAAAACCAAGGCAAGATC 
CCAAAGAACAAAATTGCAGAGTCCATAAGTATAGTTACAAACTCATGAACAAAGTGAAA 
ATAGCTTTTGTCAAGACTGTATGCTCGTGATTTCTCAAACTTCTCTTGGCTAATTACAC 
CAACCAAGGTTTTCGGGAGAGTTGGAAGCTTGAGAGCAGTGAGTTGCCTCAGATCCAAA 
TACGTCTCAAAAATGTACATCACTATCATAAAACCCACGACGGTTTCCATGAAAGGAAT 
CGCCATcccctcgaatttccccgatcgttcaaacatttggcaataaagtttcttaagat 
tgaatcctgttgccggtcttgcgatgattatcatataatttctgttgaattacgttaag 
catgtaataattaacatgtaatgcatgacgttatttatgagatgggtttttatgattag 
agtcccgcaattatacatttaatacgcgatagaaaacaaaatatagcgcgcaaactagg 
ataaattatcgcgcgcggtgtcatctatgttactagatcgggaattcactggccgtcgt 
tttacaacgtcgtgactgggaaaaccctggcgttacccaacttaatcgccttgcagcac 
atccccctttcgccagctggcgtaatagcgaagaggcccgcaccgatcgcccttcccaa 
cagttgcgcagcctgaatggcgcccgctcctttcgctttcttcccttcctttctcgcca 
cgttcgccggctttccccgtcaagctctaaatcgggggctccctttagggttccgattt 
agtgctttacggcacctcgaccccaaaaaacttgatttgggtgatggttcacgtagtgg 
gccatcgccctgatagacggtttttcgccctttgacgttggagtccacgttctttaata 
gtggactcttgttccaaactggaacaacactcaaccctatctcgggctattcttttgat 
ttataagggattttgccgatttcggaaccaccatcaaacaggattttcgcctgctgggg 
caaaccagcgtggaccgcttgctgcaactctctcagggccaggcggtgaagggcaatca 
gctgttgcccgtctcactggtgaaaagaaaaaccaccccagtacattaaaaacgtccgc 
aatgtgttattaagttgtctaagcgtcaatttgtttacaccacaa ta tatcctgcca 

SEQ ID NO:35 is the nucleic acid sequence of pBI121-antisense-AtCPP. 
Italicized sequences are the right and left border repeats. Underlined sequence is the 35S 
promoter. Sequence in upper case is the AtCPP anti-sense sequence. 

SEQ ID NO:36 

gtttacccgccaa ta ta tcctgt caaacact gat agtttaaact gaaggcgggaaacga 

caatctgatcatgagcggagaattaagggagtcacgttatgacccccgccgatgacgcg 

ggacaagccgttttacgtttggaactgacagaaccgcaacgttgaaggagccactcagc 

cgcgggtttctggagtttaatgagctaagcacatacgtcagaaaccattattgcgcgtt 

caaaagtcgcctaaggtcactatcagctagcaaatatttcttgtcaaaaatgctccact 

gacgttccataaattcccctcggtatccaattagagtctcatattcactctcaatccaa 

ataatctgcaccggatctggatcgtttcgcatgattgaacaagatggattgcacgcagg 
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ttctccggccgcttgggtggagaggctattcggctatgactgggcacaacagacaatcg 
gctgctctgatgccgccgtgttccggctgtcagcgcaggggcgcccggttctttttgtc 
aagaccgacctgtccggtgccctgaatgaactgcaggacgaggcagcgcggctatcgtg 
gctggccacgacgggcgttccttgcgcagctgtgctcgacgttgtcactgaagcgggaa 
gggactggctgctattgggcgaagtgccggggcaggatctcctgtcatctcaccttgct 
cctgccgagaaagtatccatcatggctgatgcaatgcggcggctgcatacgcttgatcc 
ggctacctgcccattcgaccaccaagcgaaacatcgcatcgagcgagcacgtactcgga 
tggaagccggtcttgtcgatcaggatgatctggacgaagagcatcaggggctcgcgcca 
gccgaactgttcgccaggctcaaggcgcgcatgcccgacggcgatgatctcgtcgtgac 
ccatggcgatgcctgcttgccgaatatcatggtggaaaatggccgcttttctggattca 
tcgactgtggccggctgggtgtggcggaccgctatcaggacatagcgttggctacccgt 
gatattgctgaagagcttggcggcgaatgggctgaccgcttcctcgtgctttacggtat 
cgccgctcccgattcgcagcgcatcgccttctatcgccttcttgacgagttcttctgag 
cgggactctggggttcgaaatgaccgaccaagcgacgcccaacctgccatcacgagatt 
tcgattccaccgccgccttctatgaaaggttgggcttcggaatcgttttccgggacgcc 
ggctggatgatcctccagcgcggggatctcatgctggagttcttcgcccacgggatctc 
tgcggaacaggcggtcgaaggtgccgatatcattacgacagcaacggccgacaagcaca 
acgccacgatcctgagcgacaatatgatcgggcccggcgtccacatcaacggcgtcggc 
ggcgactgcccaggcaagaccgagatgcaccgcgatatcttgctgcgttcggatatttt 
cgtggagttcccgccacagacccggatgatccccgatcgttcaaacatttggcaataaa 
gtttcttaagattgaatcctgttgccggtcttgcgatgattatcatataatttctgttg 
aattacgttaagcatgtaataattaacatgtaatgcatgacgttatttatgagatgggt 
ttttatgattagagtcccgcaattatacatttaatacgcgatagaaaacaaaatatagc 
gcgcaaactaggataaattatcgcgcgcggtgtcatctatgttactagatcgggcctcc 
tgtcaatgctggcggcggctctggtggtggttctggtggcggctctgagggtggtggct 
ctgagggtggcggttctgagggtggcggctctgagggaggcggttccggtggtggctct 
ggttccggtgattttgattatgaaaagatggcaaacgctaataagggggctatgaccga 
aaatgccgatgaaaacgcgctacagtctgacgctaaaggcaaacttgattctgtcgcta 
ctgattacggtgctgctatcgatggtttcattggtgacgtttccggccttgctaatggt 
aatggtgctactggtgattttgctggctctaattcccaaatggctcaagtcggtgacgg 
tgataattcacctttaatgaataatttccgtcaatatttaccttccctccctcaatcgg 
ttgaatgtcgcccttttgtctttggcccaatacgcaaaccgcctctccccgcgcgttgg 
ccgattcattaatgcagctggcacgacaggtttcccgactggaaagcgggcagtgagcg 
caacgcaattaatgtgagttagctcactcattaggcaccccaggctttacactttatgc 
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ttccggctcgtatgttgtgtggaattgtgagcggataacaatttcacacaggaaacagc 
tatqaccatqattacgccaaqcttqcatqcctgcag ggagccatagatgcaattcaatc 
aaactgaaatttctgcaagaatctcaaacacggagatctcaaagtttgaaagaaaattt 
atttcttcgactcaaaacaaacttacgaaatttaggtagaacttatatacattatattg 
taattttttgtaacaaaatgtttttattattattatagaattttactggttaaattaaa 
aatgaatagaaaaggtgaattaagaggagagaggaggtaaacattttcttctatttttt 
catattttcaggataaattattgtaaaagtttacaagatttccatttgactagtgtaaa 
tgaggaatattctctagtaagatcattatttcatctacttcttttatcttctaccagta 
gaggaataaacaatatttagctcctttgtaaatacaaattaattttccttcttgacatc 
attcaattttaattttacgtataaaataaaagatcatacctattagaacgattaaggag 
aaatacaattcgaatgagaaggatgtgccgtttgttataataaacagccacacgacgta 
aacgtaaaatgaccacatgatgggccaatagacatggaccgactactaataatagtaag 
ttacattttaggatggaataaatatcataccgacatcagttttgaaagaaaagggaaaa 
aaagaaaaaataaataaaagatatactaccgacatgagttccaaaaagcaaaaaaaaag 
atcaagccgacacagacacgcgtagagagcaaaatgactttgacgtcacaccacgaaaa 
cagacgcttcatacgtgtccctttatctctctcagtctctctataaacttagtgagacc 
ctcctctgttttactcacaaatatgcaaactagaaaacaatcatcaggaataaagggtt 
tgattacttctattggaaag gactctagaggatccatggcgattcc-b'b-bcaiiggaaacc 
gtcgtgggttttatgatagtgatgtacatttttgagacgtatttggatctgaggcaact 
cactgctctcaagcttccaactctcccgaaaaccttggttggtgtaattagccaagaga 
agtttgagaaatcacgagcatacagtcttgacaaaagctattttcactttgttcatgag 
tttgtaactatacttatggactctgcaattttgttctttgggatcttgccttggttttg 
gaagatgtctggagctgttttaccgaggttgggccttgatccggagaatgaaatactgc 
atactctttcattcttggctggtgttatgacatggtcacagatcactgatttgccattt 
tctttgtactcaactttcgtgatcgagtctcggcatgggttcaacaaacaaacaatatg 
gatgttcattagggacatgatcaaaggaacattcctctctgtcatactaggcccaccca 
ttgttgctgcgataattttcatagtccagaaaggaggtccttatcttgccatctatctg 
tgggcattcatgtttatcctgtctctagtgatgatgactatatacccggtcttgatagc 
accgctcttcaacaaattcactcctcttccagatggagacctccgggagaagattgaga 
aacttgcttcttccctaaagtttcctttgaagaagctgtttgttgtcgatggatctaca 
aggtcaagccatagcaatgcttacatgtatggtttctttaagaacaaaaggattgttct: 
ttatgatacgttgattcagcagtgcaagaatgaggatgaaattgtggcggttattgcac 
acgagcttggacattggaaactgaatcacactacatactcgttcattgcagttcaaatc 
cttgccttcttacaatttggaggatacactcttctcagaaactccactgatctcttcag 
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gagtttcggatttgatacacagcctgttctcattggtttgatcatatttcagcacactg 
taataccactgcaacatctagtaagctttggcctgaacctcgttagtcgagcgtttgag 
tttcaggctgatgcttttgctgtgaagcttgactatgcaaaagatcttcgtcctgctct 
agtgaaactacaggaagagaacttatcaacaatgaacactgatccattgtactcagctt 
atcactactcacatcctcctcttgttgaaaggcttcgagccactgatggagaagacaag 
aagacagattaacccctcgaatttccccgatcgttcaaacatttggcaataaagtttct 
taagattgaatcctgttgccggtcttgcgatgattatcatataatttctgttgaattac 
gttaagcatgtaataattaacatgtaatgcatgacgttatttatgagatgggtttttat 
gattagagtcccgcaattatacatttaatacgcgatagaaaacaaaatatagcgcgcaa 
actaggataaattatcgcgcgcggtgtcatctatgttactagatcgggaattcactggc 
cgtcgttttacaacgtcgtgactgggaaaaccctggcgttacccaacttaatcgccttg 
cagcacatccccctttcgccagctggcgtaatagcgaagaggcccgcaccgatcgccct 
tcccaacagttgcgcagcctgaatggcgcccgctcctttcgctttcttcccttcctttc 
tcgccacgttcgccggctttccccgtcaagctctaaatcgggggctccctttagggttc 
cgatttagtgctttacggcacctcgaccccaaaaaacttgatttgggtgatggttcacg 
tagtgggccatcgccctgatagacggtttttcgccctttgacgttggagtccacgttct 
ttaatagtggactcttgttccaaactggaacaacactcaaccctatctcgggctattct 
tttgatttataagggattttgccgatttcggaaccaccatcaaacaggattttcgcctg 
ctggggcaaaccagcgtggaccgcttgctgcaactctctcagggccaggcggtgaaggg 
caatcagctgttgcccgtctcactggtgaaaagaaaaaccaccccagtacattaaaaac 
gtccgcaatgtgttattaagttgtctaagcgtcaatttgt tta caeca caa tatatcct 
geca 

SEQ ID NO:36 is the nucleic acid sequence of RD29A-AtCPP. Italicized 
sequences are the right and left border repeats. Underlined sequence is the RD29A 
promoter. Sequence in bold is the AtCPP sense sequence. 

SEQ ID NO:37 

gtttacccgccaa ta ta tcctgtcaaacactgatagtttaaactgaaggcgggaaacga 

caatctgatcatgagcggagaattaagggagtcacgttatgacccccgccgatgacgcg 

ggacaagccgttttacgtttggaactgacagaaccgcaacgttgaaggagccactcagc 

cgcgggtttctggagtttaatgagctaagcacatacgtcagaaaccattattgcgcgtt 

caaaagtcgcctaaggtcactatcagctagcaaatatttcttgtcaaaaatgctccact 

gacgttccataaattcccctcggtatccaattagagtctcatattcactctcaatccaa 
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ataatctgcaccggatctggatcgtttcgcatgattgaacaagatggattgcacgcagg 
ttctccggccgcttgggtggagaggctattcggctatgactgggcacaacagacaatcg 
gctgctctgatgccgccgtgttccggctgtcagcgcaggggcgcccggttctttttgtc 
aagaccgacctgtccggtgccctgaatgaactgcaggacgaggcagcgcggctatcgtg 
gctggccacgacgggcgttccttgcgcagctgtgctcgacgttgtcactgaagcgggaa 
gggactggctgctattgggcgaagtgccggggcaggatctcctgtcatctcaccttgct 
cctgccgagaaagtatccatcatggctgatgcaatgcggcggctgcatacgcttgatcc 
ggctacctgcccattcgaccaccaagcgaaacatcgcatcgagcgagcacgtactcgga 
tggaagccggtcttgtcgatcaggatgatctggacgaagagcatcaggggctcgcgcca 
gccgaactgttcgccaggctcaaggcgcgcatgcccgacggcgatgatctcgtcgtgac 
ccatggcgatgcctgcttgccgaatatcatggtggaaaatggccgcttttctggattca 
tcgactgtggccggctgggtgtggcggaccgctatcaggacatagcgttggctacccgt 
gatattgctgaagagcttggcggcgaatgggctgaccgcttcctcgtgctttacggtat 
cgccgctcccgattcgcagcgcatcgccttctatcgccttcttgacgagttcttctgag 
cgggactctggggttcgaaatgaccgaccaagcgacgcccaacctgccatcacgagatt 
tcgattccaccgccgccttctatgaaaggttgggcttcggaatcgttttccgggacgcc 
ggctggatgatcctccagcgcggggatctcatgctggagttcttcgcccacgggatctc 
tgcggaacaggcggtcgaaggtgccgatatcattacgacagcaacggccgacaagcaca 
acgccacgatcctgagcgacaatatgatcgggcccggcgtccacatcaacggcgtcggc 
ggcgactgcccaggcaagaccgagatgcaccgcgatatcttgctgcgttcggatatttt 
cgtggagttcccgccacagacccggatgatccccgatcgttcaaacatttggcaataaa 
gtttcttaagattgaatcctgttgccggtcttgcgatgattatcatataatttctgttg 
aattacgttaagcatgtaataattaacatgtaatgcatgacgttatttatgagatgggt 
ttttatgattagagtcccgcaattatacatttaatacgcgatagaaaacaaaatatagc 
gcgcaaactaggataaattatcgcgcgcggtgtcatctatgttactagatcgggcctcc 
tgtcaatgctggcggcggctctggtggtggttctggtggcggctctgagggtggtggct 
ctgagggtggcggttctgagggtggcggctctgagggaggcggttccggtggtggctct 
ggttccggtgattttgattatgaaaagatggcaaacgctaataagggggctatgaccga 
aaatgccgatgaaaacgcgctacagtctgacgctaaaggcaaacttgattctgtcgcta 
ctgattacggtgctgctatcgatggtttcattggtgacgtttccggccttgctaatggt 
aatggtgctactggtgattttgctggctctaattcccaaatggctcaagtcggtgacgg 
tgataattcacctttaatgaataatttccgtcaatatttaccttccctccctcaatcgg 
ttgaatgtcgcccttttgtctttggcccaatacgcaaaccgcctctccccgcgcgttgg 
ccgattcattaatgcagctggcacgacaggtttcccgactggaaagcgggcagtgagcg 
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caacgcaattaatgtgagttagctcactcattaggcaccccaggctttacactttatgc 
ttccggctcgtatgttgtgtggaattgtgagcggataacaatttcacacaggaaacagc 
tatgaccatqattacgccaagcttgcatqcctgcag ggagccatagatgcaattcaatc 
aaactgaaatttctgcaagaatctcaaacacggagatctcaaagtttgaaagaaaattt 
atttcttcgactcaaaacaaacttacgaaatttaggtagaacttatatacattatattg 
taattttttgtaacaaaatgtttttattattattatagaattttactggttaaattaaa 
aatgaatagaaaaggtgaattaagaggagagaggaggtaaacattttcttctatttttt 
catattttcaggataaattattgtaaaagtttacaagatttccatttgactagtgtaaa 
tgaggaatattctctagtaagatcattatttcatctacttcttttatcttctaccagta 
gaggaataaacaatatttagctcctttgtaaatacaaattaattttccttcttgacatc 
attcaattttaattttacgtataaaataaaagatcatacctattagaacgattaaggag 
aaatacaattcgaatgagaaggatgtgccgtttgttataataaacagccacacgacgta 
aacgtaaaatgaccacatgatgggccaatagacatggaccgactactaataatagtaag 
ttacattttaggatggaataaatatcataccgacatcagttttgaaagaaaagggaaaa 
aaagaaaaaataaataaaagatatactaccgacatgagttccaaaaagcaaaaaaaaag 
atcaagccgacacagacacgcgtagagagcaaaatgactttgacgtcacaccacgaaaa 
cagacgcttcatacgtgtccctttatctctctcagtctctctataaacttagtgagacc 
ctcctctgttttactcacaaatatgcaaactagaaaacaatcatcaggaataaagggtt 
tgattacttctattggaaag g actctagaggatcctc ccaa tg tccaagc t eg tg tgea 
ataaccgccacaatttcatcctcattcttgcactgctgaatcaacgtatcataaagaac 
aatccttttgttcttaaagaaaccatacatgtaagcattgctatggcttgaccttgtag 
atccatcgacaacaaacagcttcttcaaaggaaactttagggaagaagcaagtttctca 
atcttctcccggaggtctccatctggaagaggagtgaatttgttgaagagcggtgctat 
caagaccgggtatatagtcatcatcactagagacaggataaacatgaatgcccacagat 
agatggcaagataaggacctcctttctggactatgaaaattatcgcagcaacaatgggt 
gggcctagtatgacagagaggaatgttcctttgatcatgtccctaatgaacatccatat 
tgtttgtttgttgaacccatgccgagactcgatcacgaaagttgagtacaaagaaaatg 
gcaaatcagtgatctgtgaccatgtcataacaccagccaagaatgaaagagtatgcagt 
atttcattctccggatcaaggcccaacctcggtaaaagaggatccccATCTACCCGCTT 
CGCGTCGGCATCCGGTCAGTGGCAGTGAAGGGCGAACAGTTCCTGATTAACCACAAACC 
GTTCTACTTTACTGGCTTTGGTCGTCATGAAGATGCGGACTTGCGTGGCAAAGGATTCG 
ATAACGTGCTGATGGTGCACGACCACGCATTAATGGACTGGATTGGGGCCAACTCCTAC 
CGTACCTCGCATTACCCTTACGCTGAAGAGATGCTCGACTGGGCAGATGAACATGGCAT 
CGTGGTGATTGATGAAACTGCTGCTGTCGGCTTTTCGCTCTCTTTAGGCATTGGTTTCG 
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AAGCGGGCAACAAGCCGAAAGAACTGTACAGCGAAGAGGCAGTCAACGGGGAAACTCAG 
CAAGCGCACTTACAGGCGATTAAAGAGCTGATAGCGCGTGACAAAAACCACCCAAGCGT 
GGTGATGTGGAGTATTGCCAACGAACCGGATACCCGTCCGCAAGGTGCACGGGAATATT 
TCGCGCCACTGGCGGAAGCAACGCGTAAACTCGACCCGACGCGTCCGATCACCTGCGTC 
5 AATGTAATGTTCTGCGACGCTCACACCGATACCATCAGCGATCTCTTTGATGTGCTGTG 
CCTGAACCGTTATTACGGATGGTATGTCCAAAGCGGCGATTTGGAAACGGCAGAGAAGG 
TACTGGAAAAAGAACTTCTGGCCTGGCAGGAGAAACTGTACACCGACATGTGGAGTGAA 
GAGTATCAGTGTGCATGGCTGGATATGTATCACCGCGTCTTTGATCGCGTCAGCGCCGT 
CGTCGGTGAACAGGTATGGAATTTCGCCGATTTTGCGACCTCGCAAGGCATATTGCGCG 
10 TTGGCGGTAACAAGAAAGGGATCTTCACTCGCGACCGCAAACCGAAGTCGGCGGCTTTT 
CTGCTGCAAAAACGCTGGACTGGCATGAACTTCGGTGAAAAACCGCAGCAGGGAGGCAA 
ACAATGAATCAACAACTCTCCTGGCGCACCATCGTCGGCTACAGCCTCGGGAATTGCTA 
CCGAGCTC 1:tt'baccgagg-btgggcct:t:gatccggagaatgaaa'bac-bgca < bact:c-b-b-b 
cattct:t:ggctggtgttat:gacatgg , bcacagatcact:ga-bt:t:gccat:tt:'fcc < b'btg < bac 
15 •bcaacbttcg-bgatcgag-bc-bcggca-bggg'ttcaacaaacaaacaatatgga-bgt-bcat: 
tagggacatgat:caaaggaaca-bt:cc'bc , bct:gt:ca-bac'baggcccacccattgt:t:gct:g 
cgataa'btttca'bag-bccagaaaggagg'bcct'batc-b-bgcca-bc-ba-bc'bgtgggcattc 
atgtittatcctgtctctagt^gatga'bgac-bata'bacccggtcttga-bagcaccgctctt: 
caacaaattcactcctcttccagatggagacctccgggagaagattgagaaacttgctt 
20 cttccctaaagt:t:t:cc , bt: < bgaagaagct:gi:-btg-b < bg'bcga-bggat:ctacaaggt:caagc 
catagcaat:gct: < bacatg < bai:ggt:t , bct:t:'baagaacaaaaggat , bgttci:ti:at:ga1:ac 
gttgattcagcagtgcaagaa-bgagga-bgaaa-b-bgtggcggtfca-btgcacacgagct-bg 
gaca-btgggag ctcgaatttccccgatcgttcaaacatttggcaataaagtttcttaag 
attgaatcctgttgccggtcttgcgatgattatcatataatttctgttgaattacgtta 
agcatgtaataattaacatgtaatgcatgacgttatttatgagatgggtttttatgatt 
agagtcccgcaattatacatttaatacgcgatagaaaacaaaatatagcgcgcaaacta 
ggataaattatcgcgcgcggtgtcatctatgttactagatcgggaattcactggccgtc 
gttttacaacgtcgtgactgggaaaaccctggcgttacccaacttaatcgccttgcagc 
acatccccctttcgccagctggcgtaatagcgaagaggcccgcaccgatcgcccttccc 
aacagttgcgcagcctgaatggcgcccgctcctttcgctttcttcccttcctttctcgc 
cacgttcgccggctttccccgtcaagctctaaatcgggggctccctttagggttccgat 
ttagtgctttacggcacctcgaccccaaaaaacttgatttgggtgatggttcacgtagt 
gggccatcgccctgatagacggtttttcgccctttgacgttggagtccacgttctttaa 
tagtggactcttgttccaaactggaacaacactcaaccctatctcgggctattcttttg 
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atttataagggattttgccgatttcggaaccaccatcaaacaggattttcgcctgctgg 
ggcaaaccagcgtggaccgcttgctgcaactctctcagggccaggcggtgaagggcaat 
cagctgttgcccgtctcactggtgaaaagaaaaaccaccccagtacattaaaaacgtcc 
gcaatgtgttattaagttgtctaagcgtcaatttgtttacaccacaa ta ta tcctgcca 

5 

SEQ ID NO:37 is the nucleic acid sequence of RD29A-HP-AtCPP. Italicized 
sequences are the right and left border repeats. Underlined sequence is the RD29A 
promoter. Sequence in bold is the AtCPP anti-sense sequence. Upper case sequence 
represents the truncated GUS fragment. Bold and underlined sequence represents the A. 
10 thaliana CaaX prenyl protease sense fragment. 

SEQIDNO:38 

gtttacccgccaatata tcctgtcaaacactgatagtttaaactgaaggcgggaaacga 
caatctgatcatgagcggagaattaagggagtcacgttatgacccccgccgatgacgcg 

15 ggacaagccgttttacgtttggaactgacagaaccgcaacgttgaaggagccactcagc 
cgcgggtttctggagtttaatgagctaagcacatacgtcagaaaccattattgcgcgtt 
caaaagtcgcctaaggtcactatcagctagcaaatatttcttgtcaaaaatgctccact 
gacgttccataaattcccctcggtatccaattagagtctcatattcactctcaatccaa 
ataatctgcaccggatctggatcgtttcgcatgattgaacaagatggattgcacgcagg 

20 ttctccggccgcttgggtggagaggctattcggctatgactgggcacaacagacaatcg 
gctgctctgatgccgccgtgttccggctgtcagcgcaggggcgcccggttctttttgtc 
aagaccgacctgtccggtgccctgaatgaactgcaggacgaggcagcgcggctatcgtg 
gctggccacgacgggcgttccttgcgcagctgtgctcgacgttgtcactgaagcgggaa 
gggactggctgctattgggcgaagtgccggggcaggatctcctgtcatctcaccttgct 

25 cctgccgagaaagtatccatcatggctgatgcaatgcggcggctgcatacgcttgatcc 
ggctacctgcccattcgaccaccaagcgaaacatcgcatcgagcgagcacgtactcgga 
tggaagccggtcttgtcgatcaggatgatctggacgaagagcatcaggggctcgcgcca 
gccgaactgttcgccaggctcaaggcgcgcatgcccgacggcgatgatctcgtcgtgac 
ccatggcgatgcctgcttgccgaatatcatggtggaaaatggccgcttttctggattca 

30 tcgactgtggccggctgggtgtggcggaccgctatcaggacatagcgttggctacccgt 
gatattgctgaagagcttggcggcgaatgggctgaccgcttcctcgtgctttacggtat 
cgccgctcccgattcgcagcgcatcgccttctatcgccttcttgacgagttcttctgag 
cgggactctggggttcgaaatgaccgaccaagcgacgcccaacctgccatcacgagatt 
tcgattccaccgccgccttctatgaaaggttgggcttcggaatcgttttccgggacgcc 
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ggctggatgatcctccagcgcggggatctcatgctggagttcttcgcccacgggatctc 
tgcggaacaggcggtcgaaggtgccgatatcattacgacagcaacggccgacaagcaca 
acgccacgatcctgagcgacaatatgatcgggcccggcgtccacatcaacggcgtcggc 
ggcgactgcccaggcaagaccgagatgcaccgcgatatcttgctgcgttcggatatttt 
5 cgtggagttcccgccacagacccggatgatccccgatcgttcaaacatttggcaataaa 
gtttcttaagattgaatcctgttgccggtcttgcgatgattatcatataatttctgttg 
aattacgttaagcatgtaataattaacatgtaatgcatgacgttatttatgagatgggt 
ttttatgattagagtcccgcaattatacatttaatacgcgatagaaaacaaaatatagc 
gcgcaaactaggataaattatcgcgcgcggtgtcatctatgttactagatcgggcctcc 

10 tgtcaatgctggcggcggctctggtggtggttctggtggcggctctgagggtggtggct 
ctgagggtggcggttctgagggtggcggctctgagggaggcggttccggtggtggctct 
ggttccggtgattttgattatgaaaagatggcaaacgctaataagggggctatgaccga 
aaatgccgatgaaaacgcgctacagtctgacgctaaaggcaaacttgattctgtcgcta 
ctgattacggtgctgctatcgatggtttcattggtgacgtttccggccttgctaatggt 

15 aatggtgctactggtgattttgctggctctaattcccaaatggctcaagtcggtgacgg 
tgataattcacctttaatgaataatttccgtcaatatttaccttccctccctcaatcgg 
ttgaatgtcgcccttttgtctttggcccaatacgcaaaccgcctctccccgcgcgttgg 
ccgattcattaatgcagctggcacgacaggtttcccgactggaaagcgggcagtgagcg 
caacgcaattaatgtgagttagctcactcattaggcaccccaggctttacactttatgc 

20 ttccggctcgtatgttgtgtggaattgtgagcggataacaatttcacacaggaaacagc 
tatgaccatgattacgccaagcttgcatgcctgcag ggagccatagatgcaattcaatc 
aaactgaaatttctgcaagaatctcaaacacggagatctcaaagtttgaaagaaaattt 
atttcttcgactcaaaacaaacttacgaaatttaggtagaacttatatacattatattg 
taattttttgtaacaaaatgtttttattattattatagaattttactggttaaattaaa 

25 aatgaatagaaaaggtgaattaagaggagagaggaggtaaacattttcttctatttttt 
catattttcaggataaattattgtaaaagtttacaagatttccatttgactagtgtaaa 
tgaggaatattctctagtaagatcattatttcatctacttcttttatcttctaccagta 
gaggaataaacaatatttagctcctttgtaaatacaaattaattttccttcttgacatc 
attcaattttaattttacgtataaaataaaagatcatacctattagaacgattaaggag 

30 aaatacaattcgaatgagaaggatgtgccgtttgttataataaacagccacacgacgta 
aacgtaaaatgaccacatgatgggccaatagacatggaccgactactaataatagtaag 
ttacattttaggatggaataaatatcataccgacatcagttttgaaagaaaagggaaaa 
aaagaaaaaataaataaaagatatactaccgacatgagttccaaaaagcaaaaaaaaag 
atcaagccgacacagacacgcgtagagagcaaaatgactttgacgtcacaccacgaaaa 
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cagacgct t ca t acgt gtcccttt at ctctctcagtctctct at aaacttagtgagacc 
ctcctctgttttactcacaaatatgcaaactagaaaacaatcatcaggaataaagggtt 
tgattacttctattggaaag gactctagaggatccTTAATCTGTCTTCTTGTCTTCTCC 
ATCAGTGGCTCGAAGCCTTTCAACAAGAGGAGGATGTGAGTAGTGATAAGCTGAGTACA 
5 ATGGATCAGTGTTCATTGTTGATAAGTTCTCTTCCTGTAGTTTCACTAGAGCAGGACGA 
AGATCTTTTGCATAGTCAAGCTTCACAGCAAAAGCATCAGCCTGAAACTCAAACGCTCG 
ACTAACGAGGTTCAGGCCAAAGCTTACTAGATGTTGCAGTGGTATTACAGTGTGCTGAA 
AT AT GAT C AAAC C AAT G AG AAC AG G C T G T G TAT C AAAT C C G AAAC T C C T G AAG AG AT C A 
GTGGAGTTTCTGAGAAGAGTGTATCCTCCAAATTGTAAGAAGGCAAGGATTTGAACTGC 

10 AATGAACGAGTATGTAGTGTGATTCAGTTTCCAATGTCCAAGCTCGTGTGCAATAACCG 
CCACAATTTCATCCTCATTCTTGCACTGCTGAATCAACGTATCATAAAGAACAATCCTT 
TTGTTCTTAAAGAAACCATACATGTAAGCATTGCTATGGCTTGACCTTGTAGATCCATC 
GACAACAAACAGCTTCTTCAAAGGAAACTTTAGGGAAGAAGCAAGTTTCTCAATCTTCT 
CCCGGAGGTCTCCATCTGGAAGAGGAGTGAATTTGTTGAAGAGCGGTGCTATCAAGACC 

15 GGGTATATAGTCATCATCACTAGAGACAGGATAAACATGAATGCCCACAGATAGATGGC 
AAGATAAGGACCTCCTTTCTGGACTATGAAAATTATCGCAGCAACAATGGGTGGGCCTA 
GTATGACAGAGAGGAATGTTCCTTTGATCATGTCCCTAATGAACATCCATATTGTTTGT 
TTGTTGAACCCATGCCGAGACTCGATCACGAAAGTTGAGTACAAAGAAAATGGCAAATC 
AGTGATCTGTGACCATGTCATAACACCAGCCAAGAATGAAAGAGTATGCAGTATTTCAT 

20 TCTCCGGATCAAGGCCCAACCTCGGTAAAACAGCTCCAGACATCTTCCAAAACCAAGGC 
AAGATCCCAAAGAACAAAATTGCAGAGTCCATAAGTATAGTTACAAACTCATGAACAAA 
GTGAAAATAGCTTTTGTCAAGACTGTATGCTCGTGATTTCTCAAACTTCTCTTGGCTAA 
TTACACCAACCAAGGTTTTCGGGAGAGTTGGAAGCTTGAGAGCAGTGAGTTGCCTCAGA 
TCCAAATACGTCTCAAAAATGTACATCACTATCATAAAACCCACGACGGTTTCCATGAA 

25 AGGAATCGCCATcccctcgaatttccccgatcgttcaaacatttggcaataaagtttct 
taagattgaatcctgttgccggtcttgcgatgattatcatataatttctgttgaattac 
gttaagcatgtaataattaacatgtaatgcatgacgttatttatgagatgggtttttat 
gattagagtcccgcaattatacatttaatacgcgatagaaaacaaaatatagcgcgcaa 
actaggataaattatcgcgcgcggtgtcatctatgttactagatcgggaattcactggc 

30 cgtcgttttacaacgtcgtgactgggaaaaccctggcgttacccaacttaatcgccttg 
cagcacatccccctttcgccagctggcgtaatagcgaagaggcccgcaccgatcgccct 
tcccaacagttgcgcagcctgaatggcgcccgctcctttcgctttcttcccttcctttc 
tcgccacgttcgccggctttccccgtcaagctctaaatcgggggctccctttagggttc 
cgatttagtgctttacggcacctcgaccccaaaaaacttgatttgggtgatggttcacg 
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tagtgggccatcgccctgatagacggtttttcgccctttgacgttggagtccacgttct 
ttaatagtggactcttgttccaaactggaacaacactcaaccctatctcgggctattct 
tttgatttataagggattttgccgatttcggaaccaccatcaaacaggattttcgcctg 
ctggggcaaaccagcgtggaccgcttgctgcaactctctcagggccaggcggtgaaggg 
caatcagctgttgcccgtctcactggtgaaaagaaaaaccaccccagtacattaaaaac 
gtccgcaatgtgttattaagttgtctaagcgtcaatttgt tta caeca caata tatcct 
geca 

SEQ ID NO:38 is the nucleic acid sequence of RD29A-antisense-AtCPP. 
Italicized sequences are the right and left border repeats. Underlined sequence is the 
RD29A promoter. Sequence in upper case sequence is the AtCPP anti-sense sequence. 

SEQ ID NO:39 

gtttacccgccaa ta ta tcctgt caaacact gat agtttaaactgaaggcgggaaacga 
caatctgatcatgagcggagaattaagggagtcacgttatgacccccgccgatgacgcg 
ggacaagccgttttacgtttggaactgacagaaccgcaacgttgaaggagccactcagc 
cgcgggtttctggagtttaatgagctaagcacatacgtcagaaaccattattgcgcgtt 
caaaagtcgcctaaggtcactatcagctagcaaatatttcttgtcaaaaatgctccact 
gacgttccataaattcccctcggtatccaattagagtctcatattcactctcaatccaa 
ataatctgcaccggatctggatcgtttcgcatgattgaacaagatggattgcacgcagg 
ttctccggccgcttgggtggagaggctattcggctatgactgggcacaacagacaatcg 
gctgctctgatgccgccgtgttccggctgtcagcgcaggggcgcccggttctttttgtc 
aagaccgacctgtccggtgccctgaatgaactgcaggacgaggcagcgcggctatcgtg 
gctggccacgacgggcgttccttgcgcagctgtgctcgacgttgtcactgaagcgggaa 
gggactggctgctattgggcgaagtgccggggcaggatctcctgtcatctcaccttgct 
cctgccgagaaagtatccatcatggctgatgcaatgcggcggctgcatacgcttgatcc 
ggctacctgcccattcgaccaccaagcgaaacatcgcatcgagcgagcacgtactcgga 
tggaagccggtcttgtcgatcaggatgatctggacgaagagcatcaggggctcgcgcca 
gccgaactgttcgccaggctcaaggcgcgcatgcccgacggcgatgatctcgtcgtgac 
ccatggcgatgcctgcttgccgaatatcatggtggaaaatggccgcttttctggattca 
tcgactgtggccggctgggtgtggcggaccgctatcaggacatagcgttggctacccgt 
gatattgctgaagagcttggcggcgaatgggctgaccgcttcctcgtgctttacggtat 
cgccgctcccgattcgcagcgcatcgccttctatcgccttcttgacgagttcttctgag 
cgggactctggggttcgaaatgaccgaccaagcgacgcccaacctgccatcacgagatt 
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tcgattccaccgccgccttctatgaaaggttgggcttcggaatcgttttccgggacgcc 
ggct gga t gatcctccagcgcggggatc teat get ggagttcttcgcccacgggat etc 
tgcggaacaggcggtcgaaggtgccgatatcattacgacagcaacggccgacaagcaca 
acgccacgatcctgagcgacaatatgatcgggcccggcgtccacatcaacggcgtcggc 
5 ggcgactgcccaggcaagaccgagatgcaccgcgatatcttgctgcgttcggatatttt 
cgtggagttcccgccacagacccggatgatccccgatcgttcaaacatttggcaataaa 
gtttcttaagattgaatcctgttgccggtcttgcgatgattatcatataatttctgttg 
aattacgttaagcatgtaataattaacatgtaatgcatgacgttatttatgagatgggt 
ttttatgattagagtcccgcaattatacatttaatacgcgatagaaaacaaaatatagc 

10 gcgcaaactaggataaattatcgcgcgcggtgtcatctatgttactagatcgggcctcc 
tgtcaat get ggcggcggctctggtggtggttctggtggcggctctgagggtggt ggct 
ctgagggtggcggttctgagggtggcggctctgagggaggcggttccggtggtggctct 
ggttccggtgattttgattatgaaaagatggcaaacgctaataagggggctatgaccga 
aaatgccgatgaaaacgcgctacagtctgacgctaaaggcaaacttgattctgtcgcta 

15 ctgattacggtgctgctatcgatggtttcattggtgacgtttccggccttgctaatggt 
aatggt get act ggtgatttt get ggct ctaattcccaaat ggct caagteggtgaegg 
tgataattcacctttaatgaataatttccgtcaatatttaccttccctccctcaatcgg 
ttgaatgtcgcccttttgtctttggcccaatacgcaaaccgcctctccccgcgcgttgg 
ccgattcattaatgcagctggcacgacaggtttcccgactggaaagcgggcagtgagcg 

20 caacgcaattaatgtgagttagctcactcattaggcaccccaggctttacactttatgc 
ttccggctcgtatgttgtgtggaattgtgagcggataacaatttcacacaggaaacagc 
tatgaccatgattacgccaagctGGGAAATTTTTCGCCAGTTCTAAATATCCGGAAACC 
TCTTGGGATGCCATTGCCCATCTATCTGTAATTTATTGACGAAATAGACGAAAAGGAAG 
GTGGCTCCTATAAAGCACATCATTGCGATAACAGAAAGGCCATTGTTGAAGATACCTCT 

25 GCTGACATTGGTCCCCAAGTGGAAGCACCACCCCATGAGGAGCACCGTGGAGTAAGAAG 
ACGTTCGAGCCACGTCGAAAAAGCAAGTGTGTTGATGTAGTATCTCCATTGACGTAAGG 
GATGACGCACAATCCAACTATCCATCGCAAGACCATTGCTCTATATAAGAAAGTTAATA 
TCATTTCGAGTGGCCACGCTGAGGGGGATCCatggcgattcctttcatggaaaccgtcg 
tgggttttatgatagtgatgtacatttttgagacgtatttggatctgaggcaactcact 

30 gctctcaagcttccaactctcccgaaaaccttggttggtgtaattagccaagagaagtt 
tgagaaatcacgagcatacagtcttgacaaaagctattttcactttgttcatgagtttg 
taactatacttatggactctgcaattttgttctttgggatcttgccttggttttggaag 
atgtctggagctgttttaccgaggttgggccttgatccggagaatgaaatactgcatac 
tctttcattcttggctggtgttatgacatggtcacagatcactgatttgccattttctt 
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tgtactcaactttcgtgatcgagtctcggcatgggttcaacaaacaaacaatatggatg 
ttcattagggacatgatcaaaggaacattcctctctgtcatactaggcccacccattgt 
tgctgcgataattttcatagtccagaaaggaggtccttatcttgccatctatctgtggg 
cattcatgtttatcctgtctctagtgatgatgactatatacccggtcttgatagcaccg 
ctcttcaacaaattcactcctcttccagatggagacctccgggagaagattgagaaact 
tgcttcttccctaaagtttcctttgaagaagctgtttgttgtcgatggatctacaaggt 
caagccatagcaatgcttacatgtatggtttctttaagaacaaaaggattgttctttat 
gatacgttgattcagcagtgcaagaatgaggatgaaattgtggcggttattgcacacga 
gcttggacattggaaactgaatcacactacatactcgttcattgcagttcaaatccttg 
ccttcttacaatttggaggatacactcttctcagaaactccactgatctcttcaggagt 
ttcggatttgatacacagcctgttctcattggtttgatcatatttcagcacactgtaat 
accactgcaacatctagtaagctttggcctgaacctcgttagtcgagcgtttgagtttc 
aggctgatgcttttgctgtgaagcttgactatgcaaaagatcttcgtcctgctctagtg 
aaactacaggaagagaacttatcaacaatgaacactgatccattgtactcagcttatca 
ctactcacatcctcctcttgttgaaaggcttcgagccactgatggagaagacaagaaga 
cagattaacccctcgaatttccccgatcgttcaaacatttggcaataaagtttcttaag 
attgaatcctgttgccggtcttgcgatgattatcatataatttctgttgaattacgtta 
agcatgtaataattaacatgtaatgcatgacgttatttatgagatgggtttttatgatt 
agagtcccgcaattatacatttaatacgcgatagaaaacaaaatatagcgcgcaaacta 
ggataaattatcgcgcgcggtgtcatctatgttactagatcgggaattcactggccgtc 
gttttacaacgtcgtgactgggaaaaccctggcgttacccaacttaatcgccttgcagc 
acatccccctttcgccagctggcgtaatagcgaagaggcccgcaccgatcgcccttccc 
aacagttgcgcagcctgaatggcgcccgctcctttcgctttcttcccttcctttctcgc 
cacgttcgccggctttccccgtcaagctctaaatcgggggctccctttagggttccgat 
ttagtgctttacggcacctcgaccccaaaaaacttgatttgggtgatggttcacgtagt 
gggccatcgccctgatagacggtttttcgccctttgacgttggagtccacgttctttaa 
tagtggactcttgttccaaactggaacaacactcaaccctatctcgggctattcttttg 
atttataagggattttgccgatttcggaaccaccatcaaacaggattttcgcctgctgg 
ggcaaaccagcgtggaccgcttgctgcaactctctcagggccaggcggtgaagggcaat 
cagctgttgcccgtctcactggtgaaaagaaaaaccaccccagtacattaaaaacgtcc 
gcaatgtgttattaagttgtctaagcgtcaatttgtttacaccacaa ta ta tcctgcca 
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SEQ ID NO:39 is the nucleic acid sequence of MuA-AtCPP. Italicized sequences 
are the right and left border repeats. Sequence in upper case is the MuA promoter. The A. 

o 

thaliana CaaX prenyl protease sense sequence is in bold. 
5 SEQ ID NO:40 

gt t tacccgccaat a tatcct gt caaacactgatagtttaaactgaaggcgggaaacga 
caatctgatcatgagcggagaattaagggagtcacgttatgacccccgccgatgacgcg 
ggacaagccgttttacgtttggaactgacagaaccgcaacgttgaaggagccactcagc 
cgcgggtttctggagtttaatgagctaagcacatacgtcagaaaccattattgcgcgtt 

10 caaaagtcgcctaaggtcactatcagctagcaaatatttcttgtcaaaaatgctccact 
gacgttccataaattcccctcggtatccaattagagtctcatattcactctcaatccaa 
ataatctgcaccggatctggatcgtttcgcatgattgaacaagatggattgcacgcagg 
ttctccggccgcttgggtggagaggctattcggctatgactgggcacaacagacaatcg 
gctgctctgatgccgccgtgttccggctgtcagcgcaggggcgcccggttctttttgtc 

15 aagaccgacctgtccggtgccctgaatgaactgcaggacgaggcagcgcggctatcgtg 
gctggccacgacgggcgttccttgcgcagctgtgctcgacgttgtcactgaagcgggaa 
gggactggctgctattgggcgaagtgccggggcaggatctcctgtcatctcaccttgct 
cctgccgagaaagtatccatcatggctgatgcaatgcggcggctgcatacgcttgatcc 
ggctacctgcccattcgaccaccaagcgaaacatcgcatcgagcgagcacgtactcgga 

20 tggaagccggtcttgtcgatcaggatgatctggacgaagagcatcaggggctcgcgcca 
gccgaactgttcgccaggctcaaggcgcgcatgcccgacggcgatgatctcgtcgtgac 
ccatggcgatgcctgcttgccgaatatcatggtggaaaatggccgcttttctggattca 
tcgactgtggccggctgggtgtggcggaccgctatcaggacatagcgttggctacccgt 
gatattgctgaagagcttggcggcgaatgggctgaccgcttcctcgtgctttacggtat 

25 cgccgctcccgattcgcagcgcatcgccttctatcgccttcttgacgagttcttctgag 
cgggactctggggttcgaaatgaccgaccaagcgacgcccaacctgccatcacgagatt 
tcgattccaccgccgccttctatgaaaggttgggcttcggaatcgttttccgggacgcc 
ggctggatgatcctccagcgcggggatctcatgctggagttcttcgcccacgggatctc 
tgcggaacaggcggtcgaaggtgccgatatcattacgacagcaacggccgacaagcaca 

30 acgccacgatcctgagcgacaatatgatcgggcccggcgtccacatcaacggcgtcggc 
ggcgactgcccaggcaagaccgagatgcaccgcgatatcttgctgcgttcggatatttt 
cgtggagttcccgccacagacccggatgatccccgatcgttcaaacatttggcaataaa 
gtttcttaagattgaatcctgttgccggtcttgcgatgattatcatataatttctgttg 
aattacgttaagcatgtaataattaacatgtaatgcatgacgttatttatgagatgggt 
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ttttatgattagagtcccgcaattatacatttaatacgcgatagaaaacaaaatatagc 
gcgcaaactaggataaattatcgcgcgcggtgtcatctatgttactagatcgggcctcc 
tgtcaatgctggcggcggctctggtggtggttctggtggcggctctgagggtggtggct 
ctgagggtggcggttctgagggtggcggctctgagggaggcggttccggtggtggctct 
5 ggttccggtgattttgattatgaaaagatggcaaacgctaataagggggctatgaccga 
aaatgccgatgaaaacgcgctacagtctgacgctaaaggcaaacttgattctgtcgcta 
ctgattacggtgctgctatcgatggtttcattggtgacgtttccggccttgctaatggt 
aatggtgctactggtgattttgctggctctaattcccaaatggctcaagtcggtgacgg 
tgataattcacctttaatgaataatttccgtcaatatttaccttccctccctcaatcgg 

10 ttgaatgtcgcccttttgtctttggcccaatacgcaaaccgcctctccccgcgcgttgg 
ccgattcattaatgcagctggcacgacaggtttcccgactggaaagcgggcagtgagcg 
caacgcaattaatgtgagttagctcactcattaggcaccccaggctttacactttatgc 
ttccggctcgtatgttgtgtggaattgtgagcggataacaatttcacacaggaaacagc 
tatgaccatgattacgccaagctGGGAAATTTTTCGCCAGTTCTAAATATCCGGAAACC 

15 TCTTGGGATGCCATTGCCCATCTATCTGTAATTTATTGACGAAATAGACGAAAAGGAAG 
GTGGCTCCTATAAAGCACATCATTGCGATAACAGAAAGGCCATTGTTGAAGATACCTCT 
GCTGACATTGGTCCCCAAGTGGAAGCACCACCCCATGAGGAGCACCGTGGAGTAAGAAG 
ACGTTCGAGCCACGTCGAAAAAGCAAGTGTGTTGATGTAGTATCTCCATTGACGTAAGG 
GATGACGCACAATCCAACTATCCATCGCAAGACCATTGCTCTATATAAGAAAGTTAATA 

20 TCATTTCGAGTGGCCACGCTGAGGGGGATCGGGATGGCGTTTCCCTACATGGAAGCCGT 
TGTCGGATTTATGATATTAATGTACATTTTTGAAACTTACTTGGATGTGCGACAACATA 
GGGCCCTCAAACTTCCTACTCTTCCAAAGACTTTAGAGGGTGTTATCAGCCAAGAGAAA 
TTTGAGAAATCTAGAGCCTATAGTCTTGATAAAAGCCACTTCCATTTTGTTCACGAGTT 
TGTGACAATAGTGACAGACTCTACAATTTTGTACTTTGGGGTATTGCCCTGGTTTTGGA 

25 AGAAATCAGGAGATTTTATGACAATAGCTGGTTTCAATGCTGAGAATGAAATACTGCAT 
ACCCTTGCCTTCTTAGCAGGGCTGATGATTTGGTCACAGATAACAGATTTGCCCTTTTC 
TCTGTACTCAACTTTTGTGATTGAGGCCCGTCATGGTTTTAATAAGCAAACACCATGGT 
TATTCTTTAGGGACATGCTTAAAGGAATTTTCCTTTCTGTAATAATTGGTCCACCTATT 
GTGGCTGCAATCATTGTAATAGTACAGAAAGGAGGTCCATACTTGGCCATCTATCTTTG 

30 GGTTTTTACGTTTGGTCTTTCTATTGTGATGATGACCCTTTATCCAGTACTAATAGCTC 
CACTCTTCAATAAGTTCACTCCACTTCCAGATGGTCAACTCAGGGAGAAAATCGAGAAA 
CTTGCTTCCTCCCTCAACTATCCGTTAAAGAAACTATTTGTTGTCGATGGATCCACAAG 
ATCAAGTC^CAGCAATGCCTATATGTATGGATTCTTCAAGAAC^ 

ATGACACATTAATTCAACAGTGCAAAGACGATGAGGAAATTGTTGCTGTTATTGCCCAT 
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GAGTTGGGACACTGGAAGCTCAACCATACTGTGTACACATTTGTTGCTATGCAGATTCT 
TACACTTCTACAATTTGGAGGATATACACTAGTGCGAAATTCAGCTGATCTGTATCGAA 
GCTTTGGGTTTGATACGCAGCCAGTCCTCATTGGGCTCATCATATTTCAGCATACTGTA 
ATCCCACTTCAGCAATTGGTCAGCTTTGGTCTGAACCTAGTCAGCCGATCATTTGAATT 
5 TCAGGCTGATGGCTTTGCCAAGAAGCTTGGATATGCATCTGGATTACGCGGTGGTCTTG 
TGAAACTACAGGAGGAGAATCTGTCAGCTATGAATACAGATCCTTGGTACTCTGCTTAT 
CACTATTCTCATCCTCCCCTTGTTGAAAGATTGGCCGCGCTGGACGT^ACCGGATAAGAA 
GGAAGACTAAgagctcgaatttccccgatcgttcaaacatttggcaataaagtttctta 
agattgaatcctgttgccggtcttgcgatgattatcatataatttctgttgaattacgt 

10 taagcatgtaataattaacatgtaatgcatgacgttatttatgagatgggtttttatga 
ttagagtcccgcaattatacatttaatacgcgatagaaaacaaaatatagcgcgcaaac 
taggataaattatcgcgcgcggtgtcatctatgttactagatcgggaattcactggccg 
tcgttttacaacgtcgtgactgggaaaaccctggcgttacccaacttaatcgccttgca 
gcacatccccctttcgccagctggcgtaatagcgaagaggcccgcaccgatcgcccttc 

15 ccaacagttgcgcagcctgaatggcgcccgctcctttcgctttcttcccttcctttctc 
gccacgttcgccggctttccccgtcaagctctaaatcgggggctccctttagggttccg 
atttagtgctttacggcacctcgaccccaaaaaacttgatttgggtgatggttcacgta 
gtgggccatcgccctgatagacggtttttcgccctttgacgttggagtccacgttcttt 
aatagtggactcttgttccaaactggaacaacactcaaccctatctcgggctattcttt 

20 tgatttataagggattttgccgatttcggaaccaccatcaaacaggattttcgcctgct 
ggggcaaaccagcgtggaccgcttgctgcaactctctcagggccaggcggtgaagggca 
atcagctgttgcccgtctcactggtgaaaagaaaaaccaccccagtacattaaaaacgt 
ccgcaatgtgttattaagttgtctaagcgtcaatttgtttacaccacaa ta ta tcctgc 
ca 

25 

SEQ ID NO:40 is the nucleic acid sequence of MuA-GmCPP. Italicized 
sequences are the right and left border repeats. Sequence in upper case is the MuA 
promoter. The G. max CaaX prenyl protease sense sequence is in upper case and bold. 



30 

SEQ ID NO:41 

gtttacccgccaa ta ta tcctgt caaacact gat agtttaaactgaaggcgggaaacga 
caatctgatcatgagcggagaattaagggagtcacgttatgacccccgccgatgacgcg 
ggacaagccgttttacgtttggaactgacagaaccgcaacgttgaaggagccactcagc 
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cgcgggtttctggagtttaatgagctaagcacatacgtcagaaaccattattgcgcgtt 
caaaagtcgcctaaggtcactatcagctagcaaatatttcttgtcaaaaatgctccact 
gacgttccataaattcccctcggtatccaattagagtctcatattcactctcaatccaa 
ataatctgcaccggatctggatcgtttcgcatgattgaacaagatggattgcacgcagg 
ttctccggccgcttgggtggagaggctattcggctatgactgggcacaacagacaatcg 
gctgctctgatgccgccgtgttccggctgtcagcgcaggggcgcccggttctttttgtc 
aagaccgacctgtccggtgccctgaatgaactgcaggacgaggcagcgcggctatcgtg 
gctggccacgacgggcgttccttgcgcagctgtgctcgacgttgtcactgaagcgggaa 
gggactggctgctattgggcgaagtgccggggcaggatctcctgtcatctcaccttgct 
cctgccgagaaagtatccatcatggctgatgcaatgcggcggctgcatacgcttgatcc 
ggctacctgcccattcgaccaccaagcgaaacatcgcatcgagcgagcacgtactcgga 
tggaagccggtcttgtcgatcaggatgatctggacgaagagcatcaggggctcgcgcca 
gccgaactgttcgccaggctcaaggcgcgcatgcccgacggcgatgatctcgtcgtgac 
ccatggcgatgcctgcttgccgaatatcatggtggaaaatggccgcttttctggattca 
tcgactgtggccggctgggtgtggcggaccgctatcaggacatagcgttggctacccgt 
gatattgctgaagagcttggcggcgaatgggctgaccgcttcctcgtgctttacggtat 
cgccgctcccgattcgcagcgcatcgccttctatcgccttcttgacgagttcttctgag 
cgggactctggggttcgaaatgaccgaccaagcgacgcccaacctgccatcacgagatt 
tcgattccaccgccgccttctatgaaaggttgggcttcggaatcgttttccgggacgcc 
ggctggatgatcctccagcgcggggatctcatgctggagttcttcgcccacgggatctc 
tgcggaacaggcggtcgaaggtgccgatatcattacgacagcaacggccgacaagcaca 
acgccacgatcctgagcgacaatatgatcgggcccggcgtccacatcaacggcgtcggc 
ggcgactgcccaggcaagaccgagatgcaccgcgatatcttgctgcgttcggatatttt 
cgtggagttcccgccacagacccggatgatccccgatcgttcaaacatttggcaataaa 
gtttcttaagattgaatcctgttgccggtcttgcgatgattatcatataatttctgttg 
aattacgttaagcatgtaataattaacatgtaatgcatgacgttatttatgagatgggt 
ttttatgattagagtcccgcaattatacatttaatacgcgatagaaaacaaaatatagc 
gcgcaaactaggataaattatcgcgcgcggtgtcatctatgttactagatcgggcctcc 
tgtcaatgctggcggcggctctggtggtggttctggtggcggctctgagggtggtggct 
ctgagggtggcggttctgagggtggcggctctgagggaggcggttccggtggtggctct 
ggttccggtgattttgattatgaaaagatggcaaacgctaataagggggctatgaccga 
aaatgccgatgaaaacgcgctacagtctgacgctaaaggcaaacttgattctgtcgcta 
ctgattacggtgctgctatcgatggtttcattggtgacgtttccggccttgctaatggt 
aatggtgctactggtgattttgctggctctaattcccaaatggctcaagtcggtgacgg 
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tgataattcacctttaatgaataatttccgtcaatatttaccttccctccctcaatcgg 
ttgaatgtcgcccttttgtctttggcccaatacgcaaaccgcctctccccgcgcgttgg 
ccgattcattaatgcagctggcacgacaggtttcccgactggaaagcgggcagtgagcg 
caacgcaattaatgtgagttagctcactcattaggcaccccaggctttacactttatgc 
ttccggctcgtatgttgtgtggaattgtgagcggataacaatttcacacaggaaacagc 
tatqaccatqattacqccaaqcttgcatgcctgcag cccacaqatqqttagagaggctt 
acgcagcaggtctcatcaagacgatctacccgagcaataatctccaggaaatcaaatac 
cttcccaagaaggttaaagatgcagtcaaaagattcaggactaactgcatcaagaacac 
agagaaagatatatttctcaagatcagaagtactattccagtatggacgattcaaggct 
tgcttcacaaaccaaggcaagtaatagagattggagtctctaaaaaggtagttcccact 
gaatcaaaggccatggagtcaaagattcaaatagaggacctaacagaactcgccgtaaa 
gactggcgaacagttcatacagagtctcttacgactcaatgacaagaagaaaatcttcg 
tcaacatggtggagcacgacacacttgtctactccaaaaatatcaaagatacagtctca 
gaagaccaaaqqgcaattqaqacttttcaacaaagggtaatatccggaaacctcctcgg 
attccattgcccagctatctgtcactttattgtgaagatagtggaaaaggaaggtggct 
cctacaaatgccatcattgcgataaaggaaaggccatcgttgaagatgcctctgccgac 
agtggtcccaaagatggacccccacccacgaggagcatcgtggaaaaagaagacgttcc 
aaccacgtcttcaaagcaagtqqattgatgtgatatctccactgacgtaagggatgacg 
cacaatcccactatccttcgcaagacccttcctctatataaggaagttcatttcatttg 
gaqagaacacgggg gactctagaggatccccgggatggcg-b-btccctacatggaagccg 
ttgtcggatttatgatattaatgtacatttttgaaacttacttggatgtgcgacaacat 
agggccctcaaacttcctactcttccaaagactttagagggtgttatcagccaagagaa 
atttgagaaatctagagcctatagtcttgataaaagccacttcca-tt-ttgttcacgagt 
ttgtgacaatagtgacagactctacaatt-ttgtactttggggtattgccctggttttgg 
aagaaatcaggagattttatgacaatagctggtttcaatgctgagaatgaaatactgca 
tacccttgccttcttagcagggctgatgatttggtcacagataacagatttgccctttt 
ctctgtactcaacttttgtgattgaggcccgtcatggttttaataagcaaacaccatgg 
ttattctttagggacatgcttaaaggaattttcctttctgtaataattggtccacctat 
tgtggctgcaatcattgtaatagtacagaaaggaggtccatacttggccatctatcttt 
gggtttttacgtttggtctttc-tattgtgatgatgaccctttatccagtactaatagct: 
ccactcttcaataagttcactccacttccagatggtcaactcagggagaaaatcgagaa 
acttgcttcctccctcaactatccgttaaagaaactatttgttgtcgatggatccacaa 
gatcaagtcacagcaatgcctatatgtatggattcttcaagaacaagaggattgtccct 
tatgacacattaattcaacagtgcaaagacgatgaggaaattgttgctgttattgccca 
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tgagttgggacactggaagctcaaccatactgtgtacacatttgttgctatgcagattc 
ttacacttctacaatttggaggatatacactagtgcgaaattcagctgatctgtatcga 
agctttgggtttgatacgcagccagtcctcattgggctcatcatatttcagcatactgt 
aatcccacttcagcaattggtcagctttggtctgaacctagtcagccgatcatttgaat 
ttcaggctgatggctttgccaagaagcttggatatgcatctggattacgcggtggtctt 
gtgaaactacaggaggagaatctgtcagctatgaatacagatccttggtactctgctta 
tcactattctcatcctccccttgttgaaagattggccgcgctggacgaaccggataaga 
aggaagactaagagctcgaatttccccgatcgttcaaacatttggcaataaagtttctt 
aagattgaatcctgttgccggtcttgcgatgattatcatataatttctgttgaattacg 
ttaagcatgtaataattaacatgtaatgcatgacgttatttatgagatgggtttttatg 
attagagtcccgcaattatacatttaatacgcgatagaaaacaaaatatagcgcgcaaa 
ctaggataaattatcgcgcgcggtgtcatctatgttactagatcgggaattcactggcc 
gtcgttttacaacgtcgtgactgggaaaaccctggcgttacccaacttaatcgccttgc 
agcacatccccctttcgccagctggcgtaatagcgaagaggcccgcaccgatcgccctt 
cccaacagttgcgcagcctgaatggcgcccgctcctttcgctttcttcccttcctttct 
cgccacgttcgccggctttccccgtcaagctctaaatcgggggctccctttagggttcc 
gatttagtgctttacggcacctcgaccccaaaaaacttgatttgggtgatggttcacgt 
agtgggccatcgccctgatagacggtttttcgccctttgacgttggagtccacgttctt 
taatagtggactcttgttccaaactggaacaacactcaaccctatctcgggctattctt 
ttgatttataagggattttgccgatttcggaaccaccatcaaacaggattttcgcctgc 
tggggcaaaccagcgtggaccgcttgctgcaactctctcagggccaggcggtgaagggc 
aatcagctgttgcccgtctcactggtgaaaagaaaaaccaccccagtacattaaaaacg 
tccgcaatgtgttattaagttgtctaagcgtcaatttgtttacaccacaa ta ta tcctg 
cca 

SEQ ID NO:41 is the nucleic acid sequence of pBI121-GmCPP. Italicized 
sequences are the right and left border repeats. Underlined sequence is the 35S promoter. 
The G. max CaaX prenyl protease sense sequence is in bold. 

SEQ ID NO:42 

gtttacccgccaatata tcctgt caaacact gat agtttaaactgaaggcgggaaacga 
caatctgatcatgagcggagaattaagggagtcacgttatgacccccgccgatgacgcg 
ggacaagccgttttacgtttggaactgacagaaccgcaacgttgaaggagccactcagc 
cgcgggtttctggagtttaatgagctaagcacatacgtcagaaaccattattgcgcgtt 
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caaaagtcgcctaaggtcactatcagctagcaaatatttcttgtcaaaaatgctccact 
gacgttccataaattcccctcggtatccaattagagtctcatattcactctcaatccaa 
ataatctgcaccggatctggatcgtttcgcatgattgaacaagatggattgcacgcagg 
ttctccggccgcttgggtggagaggctattcggctatgactgggcacaacagacaatcg 
gctgctctgatgccgccgtgttccggctgtcagcgcaggggcgcccggttctttttgtc 
aagaccgacctgtccggtgccctgaatgaactgcaggacgaggcagcgcggctatcgtg 
gctggccacgacgggcgttccttgcgcagctgtgctcgacgttgtcactgaagcgggaa 
gggactggctgctattgggcgaagtgccggggcaggatctcctgtcatctcaccttgct 
cctgccgagaaagtatccatcatggctgatgcaatgcggcggctgcatacgcttgatcc 
ggctacctgcccattcgaccaccaagcgaaacatcgcatcgagcgagcacgtactcgga 
tggaagccggtcttgtcgatcaggatgatctggacgaagagcatcaggggctcgcgcca 
gccgaactgttcgccaggctcaaggcgcgcatgcccgacggcgatgatctcgtcgtgac 
ccatggcgatgcctgcttgccgaatatcatggtggaaaatggccgcttttctggattca 
tcgactgtggccggctgggtgtggcggaccgctatcaggacatagcgttggctacccgt 
gatattgctgaagagcttggcggcgaatgggctgaccgcttcctcgtgctttacggtat 
cgccgctcccgattcgcagcgcatcgccttctatcgccttcttgacgagttcttctgag 
cgggactctggggttcgaaatgaccgaccaagcgacgcccaacctgccatcacgagatt 
tcgattccaccgccgccttctatgaaaggttgggcttcggaatcgttttccgggacgcc 
ggctggatgatcctccagcgcggggatctcatgctggagttcttcgcccacgggatctc 
tgcggaacaggcggtcgaaggtgccgatatcattacgacagcaacggccgacaagcaca 
acgccacgatcctgagcgacaatatgatcgggcccggcgtccacatcaacggcgtcggc 
ggcgactgcccaggcaagaccgagatgcaccgcgatatcttgctgcgttcggatatttt 
cgtggagttcccgccacagacccggatgatccccgatcgttcaaacatttggcaataaa 
gtttcttaagattgaatcctgttgccggtcttgcgatgattatcatataatttctgttg 
aattacgttaagcatgtaataattaacatgtaatgcatgacgttatttatgagatgggt 
ttttatgattagagtcccgcaattatacatttaatacgcgatagaaaacaaaatatagc 
gcgcaaactaggataaattatcgcgcgcggtgtcatctatgttactagatcgggcctcc 
tgtcaatgctggcggcggctctggtggtggttctggtggcggctctgagggtggtggct 
ctgagggtggcggttctgagggtggcggctctgagggaggcggttccggtggtggctct 
ggttccggtgattttgattatgaaaagatggcaaacgctaataagggggctatgaccga 
aaatgccgatgaaaacgcgctacagtctgacgctaaaggcaaacttgattctgtcgcta 
ctgattacggtgctgctatcgatggtttcattggtgacgtttccggccttgctaatggt 
aatggtgctactggtgattttgctggctctaattcccaaatggctcaagtcggtgacgg 
tgataattcacctttaatgaataatttccgtcaatatttaccttccctccctcaatcgg 
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ttgaatgtcgcccttttgtctttggcccaatacgcaaaccgcctctccccgcgcgttgg 
ccgattcattaatgcagctggcacgacaggtttcccgactggaaagcgggcagtgagcg 
caacgcaattaatgtgagttagctcactcattaggcaccccaggctttacactttatgc 
ttccggctcgtatgttgtgtggaattgtgagcggataacaatttcacacaggaaacagc 
5 tatcfaccatqattacqccaaqcttqcatqcctqcaq cccacagatggttagagaggctt 
acgcagcaggtctcatcaagacgatctacccgagcaataatctccagqaaatcaaatac 
cttcccaagaaggttaaagatgcagtcaaaagattcaggactaactgcatcaagaacac 
agagaaagatatatttctcaagatcagaagtactattccagtatggacgattcaaggct 
tgcttcacaaaccaaggcaagtaatagagattggagtctctaaaaaggtagttcccact 

10 gaatcaaaggccatggagtcaaagattcaaatagaggacctaacagaactcgccgtaaa 
gactggcgaacagttcatacagagtctcttacgactcaatgacaagaagaaaatcttcg 
tcaacatggtggagcacgacacacttgtctactccaaaaatatcaaagatacagtctca 
gaagaccaaagggcaattgagacttttcaacaaagggtaatatccggaaacctcctcgg 
attccattgcccagctatctgtcactttattgtgaagatagtggaaaaggaaggtggct 

15 cctacaaatgccatcattgcgataaaggaaaggccatcgttgaagatgcctctgccgac 
agtggtcccaaagatggacccccacccacgaggagcatcgtggaaaaagaagacgttcc 
aaccacgtcttcaaagcaagtggattgatgtgatatctccactgacgtaagggatgacg 
cacaatcccactatccttcgcaagacccttcctctatataaggaagttcatttcatttg 
gagagaacacgggg gactctagaccggt-bcgtccagcgcggccaatc-bt-bcaacaaggg 

20 gaggatgagaatagtgataagcagagtaccaaggatctgtattcatagctgacagattc 
tcctcctgtagtttcacaagaccaccgcgtaatccagatgcatatccaagct-tcttggc 
aaagccatcagcctgaaattcaaatgatcggctgactaggttcagaccaaagctgacca 
attgctgaagtgggattacagtatgctgaaatatgatgagcccaatgaggactggctgc 
gtatcaaacccaaagcttcgatacagatcagctgaatttcgcactagtgtatatcctcc 

25 aaattgtagaagtgtaagaatctgcatagcaacaaatgtgtacacagtatggttgagct 
tccagtgtcccaactcatgggcaataacagcaacaatttcctcatcgtctttgcactgt 
tgaattaatgtgtcataagggacaatcctcttgttcttgaagaatccatacatataggc 
attgctgtgacttgatcttgtggatccccATCTACCCGCTTCGCGTCGGCATCCGGTCA 
GTGGCAGTGAAGGGCGAACAGTTCCTGATTAACCACAAACCGTTCTACTTTACTGGCTT 

30 TGGTCGTCATGAAGATGCGGACTTGCGTGGCAAAGGATTCGATAACGTGCTGATGGTGC 
ACGACCACGCATTAATGGACTGGATTGGGGCCAACTCCTACCGTACCTCGCATTACCCT 
TACGCTGAAGAGATGCTCGACTGGGCAGATGAACATGGCATCGTGGTGATTGATGAAAC 
TGCTGCTGTCGGCTTTTCGCTCTCTTTAGGCATTGGTTTCGAAGCGGGCAACAAGCCGA 
AAGAACTGTACAGCGAAGAGGCAGTCAACGGGGAAACTCAGCAAGCGCACTTACAGGCG 
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ATTAAAGAGCTGATAGCGCGTGACAAAAACCACCCAAGCGTGGTGATGTGGAGTATTGC 
CAACGAACCGGATACCCGTCCGCAAGGTGCACGGGAATATTTCGCGCCACTGGCGGAAG 
CAACGCGTAAACTCGACCCGACGCGTCCGATCACCTGCGTCAATGTAATGTTCTGCGAC 
GCTCACACCGATACCATCAGCGATCTCTTTGATGTGCTGTGCCTGAACCGTTATTACGG 
ATGGTATGTCCAAAGCGGCGATTTGGAAACGGCAGAGAAGGTACTGGAAAAAGAACTTC 
TGGCCTGGCAGGAGAAACTGTACACCGACATGTGGAGTGAAGAGTATCAGTGTGCATGG 
CTGGATATGTATCACCGCGTCTTTGATCGCGTCAGCGCCGTCGTCGGTGAACAGGTATG 
GAATTTCGCCGATTTTGCGACCTCGCAAGGCATATTGCGCGTTGGCGGTAACAAGAAAG 
GGATCTTCACTCGCGACCGCAAACCGAAGTCGGCGGCTTTTCTGCTGCAAAAACGCTGG 
ACTGGCATGAACTTCGGTGAAAAACCGCAGCAGGGAGGCAAACAATGAatcaacaactc 
tcctqqcqcaccatcqtcqqctacaqcctcqqqaattqctaccqaqctc acaagatcaa 
g < bcacagcaatgcct:at:at:gi:a-bggai:tct-bcaagaacaagagga , bt:gtccc-bt:a-bgac 
acattaattcaacagtgcaaagacgatgaggaaattgttgctgttattgcccatgagtt 
gggacactggaagctcaaccatactgtgtacacat:t:t:gt < bgc , batgcaga < bt:ct:t:acac 
ttctacaatit'bggagga-ba-bacactag-bgcgaaati-bcagctga'bc'bgta'bcgaagc'b'bt: 
gggti:t:ga'bacgcagccagt:cct:cat:'bgggc'tcat:cat:at: < bt:cagcat:ac'bgt:aat:ccc 
act:tcagcaat:-bgg < bcagctt:-bggtc'bgaacct:agt:cagccgat:ca-bt:tgaat:t:t:cagg 
ctgatggc-bttgccaagaagcttggata-bgca'bctggat'bacgcggtggtcttgtgaaa 
c < bacaggaggagaat:ci:gtcagctatgaa < bacagat:cc-b1:gg'bact:ct:gc-bt:a < bcact:a 
t'bctca'bcc-bcccct'bg'b'bgaaaga-b-bggccgcgc-bggacgaaccgg gagctcgaattt 
ccccgatcgttcaaacatttggcaataaagtttcttaagattgaatcctgttgccggtc 
ttgcgatgattatcatataatttctgttgaattacgttaagcatgtaataattaacatg 
taatgcatgacgttatttatgagatgggtttttatgattagagtcccgcaattatacat 
ttaatacgcgatagaaaacaaaatatagcgcgcaaactaggataaattatcgcgcgcgg 
tgtcatctatgttactagatcgggaattcactggccgtcgttttacaacgtcgtgactg 
ggaaaaccctggcgttacccaacttaatcgccttgcagcacatccccctttcgccagct 
ggcgtaatagcgaagaggcccgcaccgatcgcccttcccaacagttgcgcagcctgaat 
ggcgcccgctcctttcgctttcttcccttcctttctcgccacgttcgccggctttcccc 
gtcaagctctaaatcgggggctccctttagggttccgatttagtgctttacggcacctc 
gaccccaaaaaacttgatttgggtgatggttcacgtagtgggccatcgccctgatagac 
ggtttttcgccctttgacgttggagtccacgttctttaatagtggactcttgttccaaa 
ctggaacaacactcaaccctatctcgggctattcttttgatttataagggattttgccg 
atttcggaaccaccatcaaacaggattttcgcctgctggggcaaaccagcgtggaccgc 
ttgctgcaactctctcagggccaggcggtgaagggcaatcagctgttgcccgtctcact 
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ggtgaaaagaaaaaccaccccagtacattaaaaacgtccgcaatgtgttattaagttgt 
ctaagcgtcaattt gttta caeca caatata tcctgcca 



SEQ ID NO:42 is the nucleic acid sequence of pBI121-HP-GmCPP. Italicized 
sequences are the right and left border repeats. Underlined sequence is the 35S promoter. 
Bold sequence is the antisense prenyl protease fragment of G. max. Bold and underlined 
sequence is the G. max sense prenyl protease fragment and sequence in upper case is the 
truncated GUS fragment. 

SEQ ID NO:43 

gtttacccgccaata ta tcctgtcaaacactgatagtttaaactgaaggcgggaaacga 
caatctgatcatgagcggagaattaagggagtcacgttatgacccccgccgatgacgcg 
ggacaagccgttttacgtttggaactgacagaaccgcaacgttgaaggagccactcagc 
cgcgggtttctggagtttaatgagctaagcacatacgtcagaaaccattattgcgcgtt 
caaaagtcgcctaaggtcactatcagctagcaaatatttcttgtcaaaaatgctccact 
gacgttccataaattcccctcggtatccaattagagtctcatattcactctcaatccaa 
ataatctgcaccggatctggatcgtttcgcatgattgaacaagatggattgcacgcagg 
ttctccggccgcttgggtggagaggctattcggctatgactgggcacaacagacaatcg 
gctgctctgatgccgccgtgttccggctgtcagcgcaggggcgcccggttctttttgtc 
aagaccgacctgtccggtgccctgaatgaactgcaggacgaggcagcgcggctatcgtg 
gctggccacgacgggcgttccttgcgcagctgtgctcgacgttgtcactgaagcgggaa 
gggactggctgctattgggcgaagtgccggggcaggatctcctgtcatctcaccttgct 
cctgccgagaaagtatccatcatggctgatgcaatgcggcggctgcatacgcttgatcc 
ggctacctgcccattcgaccaccaagcgaaacatcgcatcgagcgagcacgtactcgga 
tggaagccggtcttgtcgatcaggatgatctggacgaagagcatcaggggctcgcgcca 
gccgaactgttcgccaggctcaaggcgcgcatgcccgacggcgatgatctcgtcgtgac 
ccatggcgatgcctgcttgccgaatatcatggtggaaaatggccgcttttctggattca 
tcgactgtggccggctgggtgtggcggaccgctatcaggacatagcgttggctacccgt 
gatattgctgaagagcttggcggcgaatgggctgaccgcttcctcgtgctttacggtat 
cgccgctcccgattcgcagcgcatcgccttctatcgccttcttgacgagttcttctgag 
cgggactctggggttcgaaatgaccgaccaagcgacgcccaacctgccatcacgagatt 
tcgattccaccgccgccttctatgaaaggttgggcttcggaatcgttttccgggacgcc 
ggctggatgatcctccagcgcggggatctcatgctggagttcttcgcccacgggatctc 
tgcggaacaggcggtcgaaggtgccgatatcattacgacagcaacggccgacaagcaca 
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acgccacgatcctgagcgacaatatgatcgggcccggcgtccacatcaacggcgtcggc 
ggcgactgcccaggcaagaccgagatgcaccgcgatatcttgctgcgttcggatatttt 
cgtggagttcccgccacagacccggatgatccccgatcgttcaaacatttggcaataaa 
gtttcttaagattgaatcctgttgccggtcttgcgatgattatcatataatttctgttg 
5 aattacgttaagcatgtaataattaacatgtaatgcatgacgttatttatgagatgggt 
ttttatgattagagtcccgcaattatacatttaatacgcgatagaaaacaaaatatagc 
gcgcaaactaggataaattatcgcgcgcggtgtcatctatgttactagatcgggcctcc 
tgtcaatgctggcggcggctctggtggtggttctggtggcggctctgagggtggtggct 
ctgagggtggcggttctgagggtggcggctctgagggaggcggttccggtggtggctct 

10 ggttccggtgattttgattatgaaaagatggcaaacgctaataagggggctatgaccga 
aaatgccgatgaaaacgcgctacagtctgacgctaaaggcaaacttgattctgtcgcta 
ctgattacggtgctgctatcgatggtttcattggtgacgtttccggccttgctaatggt 
aatggtgctactggtgattttgctggctctaattcccaaatggctcaagtcggtgacgg 
tgataattcacctttaatgaataatttccgtcaatatttaccttccctccctcaatcgg 

15 ttgaatgtcgcccttttgtctttggcccaatacgcaaaccgcctctccccgcgcgttgg 
ccgattcattaatgcagctggcacgacaggtttcccgactggaaagcgggcagtgagcg 
caacgcaattaatgtgagttagctcactcattaggcaccccaggctttacactttatgc 
ttccggctcgtatgttgtgtggaattgtgagcggataacaatttcacacaggaaacagc 
tatqaccatgattacgccaagcttgcatgcctgcag cccacagatggttagagaggctt 

20 acgcagcaggtctcatcaagacgatctacccgagcaataatctccaggaaatcaaatac 
cttcccaagaaggttaaagatgcagtcaaaagattcaggactaactgcatcaagaacac 
aqagaaagatatatttctcaagatcagaagtactattccagtatggacgattcaaggct 
tgcttcacaaaccaaggcaagtaatagagattggagtctctaaaaaggtagttcccact 
gaatcaaaggccatggagtcaaagattcaaatagaggacctaacagaactcgccgtaaa 

25 gactggcgaacagttcatacagagtctcttacgactcaatgacaagaagaaaatcttcg 
tcaacatggtggagcacgacacacttgtctactccaaaaatatcaaagatacagtctca 
gaagaccaaagggcaattgagacttttcaacaaagggtaatatccggaaacctcctcgg 
attccattgcccagctatctgtcactttattgtgaagatagtggaaaaggaaggtggct 
cctacaaatgccatcattgcgataaaggaaaggccatcgttgaagatgcctctgccgac 

30 agtggtcccaaagatggacccccacccacgaggagcatcgtggaaaaagaagacgttcc 
aaccacgtcttcaaagcaagtggattgatgtgatatctccactgacgtaagggatqacg 
cacaatcccactatccttcgcaagacccttcctctatataaggaagttcatttcatttg 
gagagaacacgggg gactctagaggatccccgggt'bagtc-bfccc-b-bctta'bccgg-bticg 
tccagcgcggccaatctttcaacaaggggaggatgagaatagtgataagcagagtacca 
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aggatctgtattcatagctgacagattctcctcctgtagtttcacaagaccaccgcgta 
atccagatgcatatccaagcttcttggcaaagccatcagcctgaaattcaaatgatcgg 
ctgactaggttcagaccaaagctgaccaattgctgaagtgggattacagtatgctgaaa 
tatgatgagcccaatgaggactggctgcgtatcaaacccaaagcttcgatacagatcag 
ctgaatttcgcactagtgtatatcctccaaattgtagaagtgtaagaatctgcatagca 
acaaatgtgtacacagtatggttgagcttccagtgtcccaactcatgggcaataacagc 
aacaatttcctcatcgtctttgcactgttgaattaatgtgtcataagggacaatcctct 
tgttcttgaagaatccatacatataggcattgctgtgacttgatcttgtggatccatcg 
acaacaaatagtttctttaacggatagttgagggaggaagcaagttt:ctcgat*bttc*tc 
cctgagttgaccatctggaagtggagtgaacttattgaagagtggagctattagtactg 
gataaagggtcatcatcacaatagaaagaccaaacgtaaaaacccaaagatagatggcc 
aagtatggacctcctttctgtactattacaatgattgcagccacaataggtggaccaat 
tattacagaaaggaaaattcctttaagcatgtccctaaagaa-baaccatggtgtttgct 
tattaaaaccatgacgggcctcaatcacaaaagttgagtacagagaaaagggcaaatct 
gttatctgtgaccaaatcatcagccctgctaagaaggcaagggtatgcagtatttcatt 
ctcagcattgaaaccagctattgtcataaaatctcctgatttcttccaaaaccagggca 
ataccccaaagtacaaaattgtagagtctgtcactattgtcacaaactcgtgaacaaaa 
tggaagtggcttttatcaagactataggctctagatttctcaaatttctcttggctgat 
aacaccctctaaagtctttggaagagtaggaagtttgagggccctatgttgtcgcacat 
ccaagtaagtttcaaaaatgtacattaatatcataaatccgacaacggcttccatgtag 
ggaaacgccatgagctcgaatttccccgatcgttcaaacatttggcaataaagtttctt 
aagattgaatcctgttgccggtcttgcgatgattatcatataatttctgttgaattacg 
ttaagcatgtaataattaacatgtaatgcatgacgttatttatgagatgggtttttatg 
attagagtcccgcaattatacatttaatacgcgatagaaaacaaaatatagcgcgcaaa 
ctaggataaattatcgcgcgcggtgtcatctatgttactagatcgggaattcactggcc 
gtcgttttacaacgtcgtgactgggaaaaccctggcgttacccaacttaatcgccttgc 
agcacatccccctttcgccagctggcgtaatagcgaagaggcccgcaccgatcgccctt 
cccaacagttgcgcagcctgaatggcgcccgctcctttcgctttcttcccttcctttct 
cgccacgttcgccggctttccccgtcaagctctaaatcgggggctccctttagggttcc 
gatttagtgctttacggcacctcgaccccaaaaaacttgatttgggtgatggttcacgt 
agtgggccatcgccctgatagacggtttttcgccctttgacgttggagtccacgttctt 
taatagtggactcttgttccaaactggaacaacactcaaccctatctcgggctattctt 
ttgatttataagggattttgccgatttcggaaccaccatcaaacaggattttcgcctgc 
tggggcaaaccagcgtggaccgcttgctgcaactctctcagggccaggcggtgaagggc 
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aatcagctgttgcccgtctcactggtgaaaagaaaaaccaccccagtacattaaaaacg 
tccgcaatgtgttattaagttgtctaagcgtcaatttgtttacaccacaa ta tatcctg 
cca 



5 SEQ ID NO:43 is the nucleic acid sequence of pBI121-antisense-GmCPP. 

Italicized sequences are the right and left border repeats. Underlined sequence is the 35S 
promoter. Sequence in bold is the GmCPP anti-sense sequence. 



SEQ ID NO:44 

10 gtttacccgccaa tata tcctgtcaaacactgatagtttaaactgaaggcgggaaacga 
caatctgatcatgagcggagaattaagggagtcacgttatgacccccgccgatgacgcg 
ggacaagccgttttacgtttggaactgacagaaccgcaacgttgaaggagccactcagc 
cgcgggtttctggagtttaatgagctaagcacatacgtcagaaaccattattgcgcgtt 
caaaagtcgcctaaggtcactatcagctagcaaatatttcttgtcaaaaatgctccact 

15 gacgttccataaattcccctcggtatccaattagagtctcatattcactctcaatccaa 
ataatctgcaccggatctggatcgtttcgcatgattgaacaagatggattgcacgcagg 
ttctccggccgcttgggtggagaggctattcggctatgactgggcacaacagacaatcg 
gctgctctgatgccgccgtgttccggctgtcagcgcaggggcgcccggttctttttgtc 
aagaccgacctgtccggtgccctgaatgaactgcaggacgaggcagcgcggctatcgtg 

20 gctggccacgacgggcgttccttgcgcagctgtgctcgacgttgtcactgaagcgggaa 
gggactggctgctattgggcgaagtgccggggcaggatctcctgtcatctcaccttgct 
cctgccgagaaagtatccatcatggctgatgcaatgcggcggctgcatacgcttgatcc 
ggctacctgcccattcgaccaccaagcgaaacatcgcatcgagcgagcacgtactcgga 
tggaagccggtcttgtcgatcaggatgatctggacgaagagcatcaggggctcgcgcca 

25 gccgaactgttcgccaggctcaaggcgcgcatgcccgacggcgatgatctcgtcgtgac 
ccatggcgatgcctgcttgccgaatatcatggtggaaaatggccgcttttctggattca 
tcgactgtggccggctgggtgtggcggaccgctatcaggacatagcgttggctacccgt 
gatattgctgaagagcttggcggcgaatgggctgaccgcttcctcgtgctttacggtat 
cgccgctcccgattcgcagcgcatcgccttctatcgccttcttgacgagttcttctgag 

30 cgggactctggggttcgaaatgaccgaccaagcgacgcccaacctgccatcacgagatt 
tcgattccaccgccgccttctatgaaaggttgggcttcggaatcgttttccgggacgcc 
ggctggatgatcctccagcgcggggatctcatgctggagttcttcgcccacgggatctc 
tgcggaacaggcggtcgaaggtgccgatatcattacgacagcaacggccgacaagcaca 
acgccacgatcctgagcgacaatatgatcgggcccggcgtccacatcaacggcgtcggc 

35 ggcgactgcccaggcaagaccgagatgcaccgcgatatcttgctgcgttcggatatttt 
cgtggagttcccgccacagacccggatgatccccgatcgttcaaacatttggcaataaa 
gtttcttaagattgaatcctgttgccggtcttgcgatgattatcatataatttctgttg 
aattacgttaagcatgtaataattaacatgtaatgcatgacgttatttatgagatgggt 
ttttatgattagagtcccgcaattatacatttaatacgcgatagaaaacaaaatatagc 

40 gcgcaaactaggataaattatcgcgcgcggtgtcatctatgttactagatcgggcctcc 
tgtcaatgctggcggcggctctggtggtggttctggtggcggctctgagggtggtggct 
ctgagggtggcggttctgagggtggcggctctgagggaggcggttccggtggtggctct 
ggttccggtgattttgattatgaaaagatggcaaacgctaataagggggctatgaccga 
aaatgccgatgaaaacgcgctacagtctgacgctaaaggcaaacttgattctgtcgcta 

45 ctgattacggtgctgctatcgatggtttcattggtgacgtttccggccttgctaatggt 
aatggtgctactggtgattttgctggctctaattcccaaatggctcaagtcggtgacgg 
tgataattcacctttaatgaataatttccgtcaatatttaccttccctccctcaatcgg 
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ttgaatgtcgcccttttgtctttggcccaatacgcaaaccgcctctccccgcgcgttgg 
ccgattcattaatgcagctggcacgacaggtttcccgactggaaagcgggcagtgagcg 
caacgcaattaatgtgagttagctcactcattaggcaccccaggctttacactttatgc 
ttccggctcgtatgttgtgtggaattgtgagcggataacaatttcacacaggaaacagc 
tatqaccatqattacqccaagcttqcatqcctqcag ggagccatagatgcaattcaatc 
aaactgaaatttctgcaagaatctcaaacacggagatctcaaagtttgaaagaaaattt 
atttcttcgactcaaaacaaacttacgaaatttaggtagaacttatatacattatattg 
taattttttgtaacaaaatgtttttattattattatagaattttactggttaaattaaa 
aatgaatagaaaaggtgaattaagaggagaqaggaggtaaacattttcttctatttttt 
catattttcaggataaattattgtaaaagtttacaagatttccatttgactagtgtaaa 
tgaggaatattctctagtaagatcattatttcatctacttcttttatcttctaccagta 
gaggaataaacaatatttagctcctttgtaaatacaaattaattttccttcttgacatc 
attcaattttaattttacgtataaaataaaagatcatacctattagaacgattaaggag 
aaatacaattcgaatgagaaggatgtgccgtttgttataataaacagccacacgacgta 
aacgtaaaatgaccacatgatgggccaatagacatggaccgactactaataatagtaag 
ttacattttaggatggaataaatatcataccgacatcagttttgaaagaaaagggaaaa 
aaagaaaaaataaataaaagatatactaccgacatgagttccaaaaagcaaaaaaaaag 
atcaagccgacacagacacgcgtagagagcaaaatgactttgacgtcacaccacgaaaa 
cagacgcttcatacgtgtccctttatctctctcagtctctctataaacttagtgagacc 
ctcctctgttttactcacaaatatgcaaactagaaaacaatcatcaggaataaagggtt 
tgattacttctattggaaag gactctagaggatccccggga^bggcg-bttcccbaca-tgg 
aagccgttgtcggatttatgatattaatgtacatttttgaaacttacttggatgtgcga 
caacatagggccctcaaacttcctactcttccaaagactttagagggtgttatcagcca 
agagaaatttgagaaatctagagcctatagtcttgataaaagccacttccattttgttc 
acgagtttgtgacaatagtgacagactctacaattttgtactttggggtattgccctgg 
ttttggaagaaatcaggagattttatgacaatagctggtttcaatgctgagaatgaaat 
actgcatacccttgccttcttagcagggctgatgatttggtcacagataacagatttgc 
ccttttctctgtactcaacttttgtgattgaggcccgtcatggttttaataagcaaaca 
ccatggttattctttagggacatgcttaaaggaattttcctttctgtaataattggtcc 
acctattgtggc-tgcaatcattgtaatagtacagaaaggaggtccatacttggccatct 
atctttgggtttttacgtttggtctttctattgtgatgatgaccctttatccagtacta 
atagctccactcttcaataagttcactccacttccagatggtcaactcagggagaaaat 
cgagaaacttgcttcctccctcaactatccgttaaagaaactatttgttgtcgatggat 
ccacaagatcaagtcacagcaatgcctatatgtatggattcttcaagaacaagaggatt 
gtcccttatgacacattaattcaacagtgcaaagacgatgaggaaattgttgctgttat 
tgcccatgagttgggacactggaagctcaaccatactgtgtacacatttgttgctatgc 
agattcttacacttctacaatttggaggatatacactagtgcgaaattcagctgatctg 
tatcgaagctttgggtttgatacgcagccagtcctcattgggctcatcatatttcagca 
tactgtaatcccacttcagcaattggtcagctttggtctgaacctagtcagccgatcat 
ttgaatttcaggctgatggctttgccaagaagcttggatatgcatctggattacgcggt 
ggtcttgtgaaactacaggaggagaatctgtcagctatgaatacagatccttggtactc 
tgcttatcactattctcatcctccccttgttgaaagattggccgcgctggacgaaccgg 
ataagaaggaagactaagagctcgaatttccccgatcgttcaaacatttggcaataaag 
tttcttaagattgaatcctgttgccggtcttgcgatgattatcatataatttctgttga 
attacgttaagcatgtaataattaacatgtaatgcatgacgttatttatgagatgggtt 
tttatgattagagtcccgcaattatacatttaatacgcgatagaaaacaaaatatagcg 
cgcaaactaggataaattatcgcgcgcggtgtcatctatgttactagatcgggaattca 
ctggccgtcgttttacaacgtcgtgactgggaaaaccctggcgttacccaacttaatcg 
ccttgcagcacatccccctttcgccagctggcgtaatagcgaagaggcccgcaccgatc 
gcccttcccaacagttgcgcagcctgaatggcgcccgctcctttcgctttcttcccttc 
ctttctcgccacgttcgccggctttccccgtcaagctctaaatcgggggctccctttag 
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ggttccgatttagtgctttacggcacctcgaccccaaaaaacttgatttgggtgatggt 
tcacgtagtgggccatcgccctgatagacggtttttcgccctttgacgttggagtccac 
gttctttaatagtggactcttgttccaaactggaacaacactcaaccctatctcgggct 
attcttttgatttataagggattttgccgatttcggaaccaccatcaaacaggattttc 
5 gcctgctggggcaaaccagcgtggaccgcttgctgcaactctctcagggccaggcggtg 
aagggcaatcagctgttgcccgtctcactggtgaaaagaaaaaccaccccagtacatta 
aaaacgtccgcaatgtgttattaagttgtctaagcgtcaatttgtt ta caeca caa tat 
a tcctgcca 

10 SEQ ID NO:44 is the nucleic acid sequence of pRD29A-GmCPP. Italicized 

sequences are the right and left border repeats. Underlined sequence is the RD29A 
promoter. Sequence in bold is the GmCPP sense sequence. 



SEQ ID NO:45 

15 gt ttacccgcca a tata tcctgt caaacact gat agtttaaactgaaggcgggaaacga 
caatctgatcatgagcggagaattaagggagtcacgttatgacccccgccgatgacgcg 
ggacaagccgttttacgtttggaactgacagaaccgcaacgttgaaggagccactcagc 
cgcgggtttctggagtttaatgagctaagcacatacgtcagaaaccattattgcgcgtt 
caaaagtcgcctaaggtcactatcagctagcaaatatttcttgtcaaaaatgctccact 

20 gacgttccataaattcccctcggtatccaattagagtctcatattcactctcaatccaa 
ataatctgcaccggatctggatcgtttcgcatgattgaacaagatggattgcacgcagg 
ttctccggccgcttgggtggagaggctattcggctatgactgggcacaacagacaatcg 
gctgctctgatgccgccgtgttccggctgtcagcgcaggggcgcccggttctttttgtc 
aagaccgacctgtccggtgccctgaatgaactgcaggacgaggcagcgcggctatcgtg 

25 gctggccacgacgggcgttccttgcgcagctgtgctcgacgttgtcactgaagcgggaa 
gggactggctgctattgggcgaagtgccggggcaggatctcctgtcatctcaccttgct 
cctgccgagaaagtatccatcatggctgatgcaatgcggcggctgcatacgcttgatcc 
ggctacctgcccattcgaccaccaagcgaaacatcgcatcgagcgagcacgtactcgga 
tggaagccggtcttgtcgatcaggatgatctggacgaagagcatcaggggctcgcgcca 

30 gccgaactgttcgccaggctcaaggcgcgcatgcccgacggcgatgatctcgtcgtgac 
ccatggcgatgcctgcttgccgaatatcatggtggaaaatggccgcttttctggattca 
tcgactgtggccggctgggtgtggcggaccgctatcaggacatagcgttggctacccgt 
gatattgctgaagagcttggcggcgaatgggctgaccgcttcctcgtgctttacggtat 
cgccgctcccgattcgcagcgcatcgccttctatcgccttcttgacgagttcttctgag 

35 cgggactctggggttcgaaatgaccgaccaagcgacgcccaacctgccatcacgagatt 
tcgattccaccgccgccttctatgaaaggttgggcttcggaatcgttttccgggacgcc 
ggctggatgatcctccagcgcggggatctcatgctggagttcttcgcccacgggatctc 
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tgcggaacaggcggtcgaaggtgccgatatcattacgacagcaacggccgacaagcaca 
acgccacgatcctgagcgacaatatgatcgggcccggcgtccacatcaacggcgtcggc 
ggcgactgcccaggcaagaccgagatgcaccgcgatatcttgctgcgttcggatatttt 
cgtggagttcccgccacagacccggatgatccccgatcgttcaaacatttggcaataaa 
5 gtttcttaagattgaatcctgttgccggtcttgcgatgattatcatataatttctgttg 
aattacgttaagcatgtaataattaacatgtaatgcatgacgttatt tat gaga tgggt 
ttttatgattagagtcccgcaattatacatttaatacgcgatagaaaacaaaatatagc 
gcgcaaactaggataaattatcgcgcgcggtgtcatctatgttactagatcgggcctcc 
tgtcaatgctggcggcggctctggtggtggttctggtggcggctctgagggtggtggct 

10 ctgagggtggcggttctgagggtggcggctctgagggaggcggttccggtggtggctct 
ggttccggtgattttgattatgaaaagatggcaaacgctaataagggggctatgaccga 
aaatgccgatgaaaacgcgctacagtctgacgctaaaggcaaacttgattctgtcgcta 
ctgattacggtgctgctatcgatggtttcattggtgacgtttccggccttgctaatggt 
aatggtgctactggtgattttgctggctctaattcccaaatggctcaagtcggtgacgg 

15 tgataattcacctttaatgaataatttccgtcaatatttaccttccctccctcaatcgg 
ttgaatgtcgcccttttgtctttggcccaatacgcaaaccgcctctccccgcgcgttgg 
ccgattcattaatgcagctggcacgacaggtttcccgactggaaagcgggcagtgagcg 
caacgcaattaatgtgagttagctcactcattaggcaccccaggctttacactttatgc 
ttccggctcgtatgttgtgtggaattgtgagcggataacaatttcacacaggaaacagc 

20 tatgaccatgattacgccaagcttgcatgcctgcag ggagccatagatgcaattcaatc 
aaactgaaatttctgcaagaatctcaaacacggagatctcaaagtttgaaagaaaattt 
atttcttcgactcaaaacaaacttacgaaatttaggtagaacttatatacattatattg 
taattttttgtaacaaaatgtttttattattattatagaattttactggttaaattaaa 
aatgaatagaaaaggtgaattaagaggagagaggaggtaaacattttcttctatttttt 

25 catattttcaggataaattattgtaaaagtttacaagatttccatttgactagtgtaaa 
tgaggaatattctctagtaagatcattatttcatctacttcttttatcttctaccagta 
gaggaataaacaatatttagctcctttgtaaatacaaattaattttccttcttgacatc 
attcaattttaattttacgtataaaataaaagatcatacctattagaacgattaaggag 
aaatacaattcgaatgagaaggatgtgccgtttgttataataaacagccacacgacgta 

30 aacgtaaaatgaccacatgatgggccaatagacatggaccgactactaataatagtaag 
ttacattttaggatggaataaatatcataccgacatcagttttgaaagaaaagggaaaa 
aaagaaaaaataaataaaagatatactaccgacatgagttccaaaaagcaaaaaaaaag 
atcaagccgacacagacacgcgtagagagcaaaatgactttgacgtcacaccacgaaaa 
cagacgcttcatacgtgtccctttatctctctcagtctctctataaacttagtgagacc 
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ctcctctgttttactcacaaatatgcaaactagaaaacaatcatcaggaataaagggtt 
tgattacttctattggaaag gactctagaccgg-b-bcgtccagcgcggccaatctttcaa 
caaggggaggatgagaatagtgataagcagagtaccaaggatctgtattcatagctgac 
agattctcctcctgtagtttcacaagaccaccgcgtaatccagatgcatatccaagctt 
5 cttggcaaagccatcagcctgaaattcaaatgatcggctgactaggttcagaccaaagc 
tgaccaattgctgaagtgggattacagtatgctgaaatatga-tgagcccaatgaggact 
ggctgcgtatcaaacccaaagcttcgatacagatcagctgaatttcgcactagtgtata 
tcctccaaattgtagaagtgtaagaatctgcatagcaacaaatgtgtacacagtatggt 
tgagcttccagtgtcccaactcatgggcaataacagcaacaatttcctcatcgtctttg 

10 cactgttgaattaatgtgtcataagggacaatcctcttgttcttgaagaatccatacat 
ataggcattgctgtgacttgatcttgtggatccccATCTACCCGCTTCGCGTCGGCATC 
CGGTCAGTGGCAGTGAAGGGCGAACAGTTCCTGATTAACCACAAACCGTTCTACTTTAC 
TGGCTTTGGTCGTCATGAAGATGCGGACTTGCGTGGCAAAGGATTCGATAACGTGCTGA 
TGGTGCACGACCACGCATTAATGGACTGGATTGGGGCCAACTCCTACCGTACCTCGCAT 

15 TACCCTTACGCTGAAGAGATGCTCGACTGGGCAGATGAACATGGCATCGTGGTGATTGA 
TGAAACTGCTGCTGTCGGCTTTTCGCTCTCTTTAGGCATTGGTTTCGAAGCGGGCAACA 
AGCCGAAAGAACTGTACAGCGAAGAGGCAGTCAACGGGGAAACTCAGCAAGCGCACTTA 
CAGGCGATTAAAGAGCTGATAGCGCGTGACAAAAACCACCCAAGCGTGGTGATGTGGAG 
TATTGCCAACGAACCGGATACCCGTCCGCAAGGTGCACGGGAATATTTCGCGCCACTGG 

20 CGGAAGCAACGCGTAAACTCGACCCGACGCGTCCGATCACCTGCGTCAATGTAATGTTC 
TGCGACGCTCACACCGATACCATCAGCGATCTCTTTGATGTGCTGTGCCTGAACCGTTA 
TTACGGATGGTATGTCCAAAGCGGCGATTTGGAAACGGCAGAGAAGGTACTGGAAAAAG 
AACTTCTGGCCTGGCAGGAGAAACTGTACACCGACATGTGGAGTGAAGAGTATCAGTGT 
GCATGGCTGGATATGTATCACCGCGTCTTTGATCGCGTCAGCGCCGTCGTCGGTGAACA 

25 GGTATGGAATTTCGCCGATTTTGCGACCTCGCAAGGCATATTGCGCGTTGGCGGTAACA 
AGAAAGGGATCTTCACTCGCGACCGCAAACCGAAGTCGGCGGCTTTTCTGCTGCAAAAA 
CGCTGGACTGGCATGAACTTCGGTGAAAAACCGCAGCAGGGAGGCAAACAATGAatcaa 
caactctcctggcgcaccatcgtcggctacagcctcgggaattgctaccgagctcacaa 
gatcaagtcacagcaatgcctatatgtatggattcttcaagaacaagaggattgtccct 

30 tatgacacattaattcaacagtgcaaagacgatgaggaaattgttgctgttattgccca 
tgagttgggacactggaagctcaaccatactgtgtacacatttgttgctatgcagattc 
t'bacac'btc'bacaa'tt'bggagga-ba'bacac'bag'bgcgaaattcagc'bgatc'bg'ba-bcga 
agctttgggtttgatacgcagccagtcctcat-bgggctcatcatat-btcagca-bacbgt: 
aatcccacbtcagcaat: , bggtcagct < bt:gg-bct:gaacc , bagt:cagccga < bca-b^tgaat 
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ttcaggctgatggctttgccaagaagcttggatatg'caiictgg'a-b-bacgcgg'bggtc'b-b 
gtgaaactacaggaggagaatctgtcagctatgaatacagatccttggtactctgctta 
tcactattctcatcctccccttgttgaaagattggccgcgctggacgaaccgg gagctc 

gaatttccccgatcgttcaaacatttggcaataaagtttcttaagattgaatcctgttg 
ccggtcttgcgatgattatcatataatttctgttgaattacgttaagcatgtaataatt 
aacatgtaatgcatgacgttatttatgagatgggtttttatgattagagtcccgcaatt 
atacatttaatacgcgatagaaaacaaaatatagcgcgcaaactaggataaattatcgc 
gcgcggtgtcatctatgttactagatcgggaattcactggccgtcgttttacaacgtcg 
tgactgggaaaaccctggcgttacccaacttaatcgccttgcagcacatccccctttcg 
ccagctggcgtaatagcgaagaggcccgcaccgatcgcccttcccaacagttgcgcagc 
ctgaatggcgcccgctcctttcgctttcttcccttcctttctcgccacgttcgccggct 
ttccccgtcaagctctaaatcgggggctccctttagggttccgatttagtgctttacgg 
cacctcgaccccaaaaaacttgatttgggtgatggttcacgtagtgggccatcgccctg 
atagacggtttttcgccctttgacgttggagtccacgttctttaatagtggactcttgt 
tccaaactggaacaacactcaaccctatctcgggctattcttttgatttataagggatt 
ttgccgatttcggaaccaccatcaaacaggattttcgcctgctggggcaaaccagcgtg 
gaccgcttgctgcaactctctcagggccaggcggtgaagggcaatcagctgttgcccgt 
ctcactggtgaaaagaaaaaccaccccagtacattaaaaacgtccgcaatgtgttatta 
agttgtctaagcgtcaattt gttta caeca caa tatatcctgcca 

SEQ ID NO:45 is the nucleic acid sequence of pRD29A-HP-GmCPP. Italicized 
sequences are the right and left border repeats. Underlined sequence is the RD29A 
promoter. Sequence in bold is the GmCPP antisense sequence, bold and underlined 
sequence is the GmCPP sense sequence. 

SEQ ID NO:46 

^//raccc^ccaara/fl/cc^/caaacactgatagtttaaactgaaggcgggaaacgacaatctgatcatgagcggaga 

attaagggagtcacgttatgacccccgccgatgacgcgggacaagccgttttacgtttggaactgacagaaccgcaacg 

ttgaaggagccactcagccgcgggtttctggagtttaatgagctaagcacatacgtcagaaaccattattgcgcgttcaa 

aagtcgcctaaggtcactatcagctagcaaatatttcttgtcaaaaatgctccactgacgttccataaattcccctcggtat 

ccaattagagtctcatattcactctcaatccaaataatctgcaccggatctggatcgtttcgcatgattgaacaagatgga 

ttgcacgcaggttctccggccgcttgggtggagaggctattcggctatgactgggcacaacagacaatcggctgctctga 

tgccgccgtgttccggctgtcagcgcaggggcgcccggttctttttgtcaagaccgacctgtccggtgccctgaatgaact 

gcaggacgaggcagcgcggctatcgtggctggccacgacgggcgttccttgcgcagctgtgctcgacgttgtcactgaa 

gcgggaagggactggctgctattgggcgaagtgccggggcaggatctcctgtcatctcaccttgctcctgccgagaaag 

tatccatcatggctgatgcaatgcggcggctgcatacgcttgatccggctacctgcccattcgaccaccaagcgaaacat 

cgcatcgagcgagcacgtactcggatggaagccggtcttgtcgatcaggatgatctggacgaagagcatcaggggctc 

gcgccagccgaactgttcgccaggctcaaggcgcgcatgcccgacggcgatgatctcgtcgtgacccatggcgatgcct 
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gcttgccgaatatcatggtggaaaatggccgcttttctggattcatcgactgtggccggctgggtgtggcggaccgctatc 

aggacatagcgttggctacccgtgatattgctgaagagcttggcggcgaatgggctgaccgcttcctcgtgctttacggt 

atcgccgctcccgattcgcagcgcatcgccttctatcgccttcttgacgagttcttctgagcgggactctggggttcgaaat 

gaccgaccaagcgacgcccaacctgccatcacgagatttcgattccaccgccgccttctatgaaaggttgggcttcgga 

atcgttttccgggacgccggctggatgatcctccagcgcggggatctcatgctggagttcttcgcccacgggatctctgcg 

gaacaggcggtcgaaggtgccgatatcattacgacagcaacggccgacaagcacaacgccacgatcctgagcgaca 

atatgatcgggcccggcgtccacatcaacggcgtcggcggcgactgcccaggcaagaccgagatgcaccgcgatatct 

tgctgcgttcggatattttcgtggagttcccgccacagacccggatgatccccgatcgttcaaacatttggcaataaagttt 

cttaagattgaatcctgttgccggtcttgcgatgattatcatataatttctgttgaattacgttaagcatgtaataattaacat 

gtaatgcatgacgttatttatgagatgggtttttatgattagagtcccgcaattatacatttaatacgcgatagaaaacaa 

aatatagcgcgcaaactaggataaattatcgcgcgcggtgtcatctatgttactagatcgggcctcctgtcaatgctggc 

ggcggctctggtggtggttctggtggcggctctgagggtggtggctctgagggtggcggttctgagggtggcggctctga 

gggaggcggttccggtggtggctctggttccggtgattttgattatgaaaagatggcaaacgctaataagggggctatg 

accgaaaatgccgatgaaaacgcgctacagtctgacgctaaaggcaaacttgattctgtcgctactgattacggtgctg 

ctatcgatggtttcattggtgacgtttccggccttgctaatggtaatggtgctactggtgattttgctggctctaattcccaa 

atggctcaagtcggtgacggtgataattcacctttaatgaataatttccgtcaatatttaccttccctccctcaatcggttga 

atgtcgcccttttgtctttggcccaatacgcaaaccgcctctccccgcgcgttggccgattcattaatgcagctggcacga 

caggtttcccgactggaaagcgggcagtgagcgcaacgcaattaatgtgagttagctcactcattaggcaccccaggct 

ttacactttatgcttccggctcgtatgttgtgtggaattgtgagcggataacaatttcacacaggaaacagctatgaccat 

gattacgccaagcttgcatgcctgca gggagccatagatgcaattcaatcaaactgaaatttctgcaagaatctcaaac 

acggagatctcaaagtttgaaagaaaatttatttcttcgactcaaaacaaacttacgaaatttaggtagaacttatataca 

ttatattgtaattttttgtaacaaaatgtttttattattattatagaattttactggttaaattaaaaatgaatagaaaaggtg 

aattaagaggagagaggaggtaaacattttcttctattttttcatattttcaggataaattattgtaaaagtttacaagattt 

ccatttgactagtgtaaatgaggaatattctctagtaagatcattatttcatctacttcttttatcttctaccagtagaggaat 

aaacaatatttagctcctttgtaaatacaaattaattttccttcttgacatcattcaattttaattttacgtataaaataaaag 

atcatacctattagaacgattaaggagaaatacaattcgaatgagaaggatgtgccgtttgttataataaacagccaca 

cgacgtaaacgtaaaatgaccacatgatgggccaatagacatggaccgactactaataatagtaagttacattttagga 

tggaataaatatcataccgacatcagttttgaaagaaaagggaaaaaaagaaaaaataaataaaagatatactaccg 

acatgagttccaaaaagcaaaaaaaaagatcaagccgacacagacacgcgtagagagcaaaatgactttgacgtca 

caccacgaaaacagacgcttcatacgtgtccctttatctctctcagtctctctataaacttagtgagaccctcctctgtttta 

ctcacaaatatgcaaactagaaaacaatcatcaggaataaagggtttgattacttctattggaaag gactctagaggat 

ccccgggttagtcttccttcttatccggttcgtccagcgcggccaatctttcaacaaggggaggatgagaatagtgataagcaga 

gtaccaaggatctgtattcatagctgacagattctcctcctgtagtttcacaagaccaccgcgtaatccagatgcatatccaagctt 

cttggcaaagccatcagcctgaaattcaaatgatcggctgactaggttcagaccaaagctgaccaattgctgaagtgggattaca 

gtatgctgaaatatgatgagcccaatgaggactggctgcgtatcaaacccaaagcttcgatacagatcagctgaatttcgcacta 

gtgtatatcctccaaattgtagaagtgtaagaatctgcatagcaacaaatgtgtacacagtatggttgagcttccagtgtcccaact 

catgggcaataacagcaacaatttcctcatcgtctttgcactgttgaattaatgtgtcataagggacaatcctcttgttcttgaagaat 

ccatacatataggcattgctgtgacttgatcttgtggatccatcgacaacaaatagtttctttaacggatagttgagggaggaagca 

agtttctcgattttctccctgagttgaccatctggaagtggagtgaacttattgaagagtggagctattagtactggataaagggtca 

tcatcacaatagaaagaccaaacgtaaaaacccaaagatagatggccaagtatggacctcctttctgtactattacaatgattgca 

gccacaataggtggaccaattattacagaaaggaaaattcctttaagcatgtccctaaagaataaccatggtgtttgcttattaaaa 

ccatgacgggcctcaatcacaaaagttgagtacagagaaaagggcaaatctgttatctgtgaccaaatcatcagccctgctaag 

aaggcaagggtatgcagtatttcattctcagcattgaaaccagctattgtcataaaatctcctgatttcttccaaaaccagggcaata 

ccccaaagtacaaaattgtagagtctgtcactattgtcacaaactcgtgaacaaaatggaagtggcttttatcaagactataggctc 

tagatttctcaaatttctcttggctgataacaccctctaaagtctttggaagagtaggaagtttgagggccctatgttgtcgcacatcc 

aagtaagtttcaaaaatgtacattaatatcataaatccgacaacggcttccatgtagggaaacgccatgagctcgaatttccccg 

atcgttcaaacatttggcaataaagtttcttaagattgaatcctgttgccggtcttgcgatgattatcatataatttctgttga 

attacgttaagcatgtaataattaacatgtaatgcatgacgttatttatgagatgggtttttatgattagagtcccgcaatta 

tacatttaatacgcgatagaaaacaaaatatagcgcgcaaactaggataaattatcgcgcgcggtgtcatctatgttact 

agatcgggaattcactggccgtcgttttacaacgtcgtgactgggaaaaccctggcgttacccaacttaatcgccttgca 
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gcacatccccctttcgccagctggcgtaatagcgaagaggcccgcaccgatcgcccttcccaacagttgcgcagcctga 

atggcgcccgctcctttcgctttcttcccttcctttctcgccacgttcgccggctttccccgtcaagctctaaatcgggggctc 

cctttagggttccgatttagtgctttacggcacctcgaccccaaaaaacttgatttgggtgatggttcacgtagtgggccat 

cgccctgatagacggtttttcgccctttgacgttggagtccacgttctttaatagtggactcttgttccaaactggaacaac 

actcaaccctatctcgggctattcttttgatttataagggattttgccgatttcggaaccaccatcaaacaggattttcgcct 

gctggggcaaaccagcgtggaccgcttgctgcaactctctcagggccaggcggtgaagggcaatcagctgttgcccgtc 

tcactggtgaaaagaaaaaccaccccagtacattaaaaacgtccgcaatgtgttattaagttgtctaagcgtcaattt^ 

tacaccacaatatatcctgcca 

SEQ ID NO:46 is the nucleic acid sequence of pRD29A-antisense-GmCPP. 
Italicized sequences are the right and left border repeats. Underlined sequence is the 
RD29A promoter. Sequence in bold is the GmCPP antisense sequence. 

SEQ ID NO:47 

gtttacccgccaa ta ta tcctgt caaacact gat agtttaaactgaaggcgggaaacga 
caatctgatcatgagcggagaattaagggagtcacgttatgacccccgccgatgacgcg 
ggacaagccgttttacgtttggaactgacagaaccgcaacgttgaaggagccactcagc 
cgcgggtttctggagtttaatgagctaagcacatacgtcagaaaccattattgcgcgtt 
caaaagtcgcctaaggtcactatcagctagcaaatatttcttgtcaaaaatgctccact 
gacgttccataaattcccctcggtatccaattagagtctcatattcactctcaatccaa 
ataatctgcaccggatctggatcgtttcgcatgattgaacaagatggattgcacgcagg 
ttctccggccgcttgggtggagaggctattcggctatgactgggcacaacagacaatcg 
gctgctctgatgccgccgtgttccggctgtcagcgcaggggcgcccggttctttttgtc 
aagaccgacctgtccggtgccctgaatgaactgcaggacgaggcagcgcggctatcgtg 
gctggccacgacgggcgttccttgcgcagctgtgctcgacgttgtcactgaagcgggaa 
gggactggctgctattgggcgaagtgccggggcaggatctcctgtcatctcaccttgct 
cctgccgagaaagtatccatcatggctgatgcaatgcggcggctgcatacgcttgatcc 
ggctacctgcccattcgaccaccaagcgaaacatcgcatcgagcgagcacgtactcgga 
tggaagccggtcttgtcgatcaggatgatctggacgaagagcatcaggggctcgcgcca 
gccgaactgttcgccaggctcaaggcgcgcatgcccgacggcgatgatctcgtcgtgac 
ccatggcgatgcctgcttgccgaatatcatggtggaaaatggccgcttttctggattca 
tcgactgtggccggctgggtgtggcggaccgctatcaggacatagcgttggctacccgt 
gatattgctgaagagcttggcggcgaatgggctgaccgcttcctcgtgctttacggtat 
cgccgctcccgattcgcagcgcatcgccttctatcgccttcttgacgagttcttctgag 
cgggactctggggttcgaaatgaccgaccaagcgacgcccaacctgccatcacgagatt 
tcgattccaccgccgccttctatgaaaggttgggcttcggaatcgttttccgggacgcc 
ggctggatgatcctccagcgcggggatctcatgctggagttcttcgcccacgggatctc 
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tgcggaacaggcggtcgaaggtgccgatatcattacgacagcaacggccgacaagcaca 
acgccacgatcctgagcgacaatatgatcgggcccggcgtccacatcaacggcgtcggc 
ggcgactgcccaggcaagaccgagatgcaccgcgatatcttgctgcgttcggatatttt 
cgtggagttcccgccacagacccggatgatccccgatcgttcaaacatttggcaataaa 
5 gtttcttaagattgaatcctgttgccggtcttgcgatgattatcatataatttctgttg 
aattacgttaagcatgtaataattaacatgtaatgcatgacgttatttatgagatgggt 
ttttatgattagagtcccgcaattatacatttaatacgcgatagaaaacaaaatatagc 
gcgcaaactaggataaattatcgcgcgcggtgtcatctatgttactagatcgggcctcc 
tgtcaatgctggcggcggctctggtggtggttctggtggcggctctgagggtggtggct 

10 ctgagggtggcggttctgagggtggcggctctgagggaggcggttccggtggtggctct 
ggttccggtgattttgattatgaaaagatggcaaacgctaataagggggctatgaccga 
aaatgccgatgaaaacgcgctacagtctgacgctaaaggcaaacttgattctgtcgcta 
ctgattacggtgctgctatcgatggtttcattggtgacgtttccggccttgctaatggt 
aatggtgctactggtgattttgctggctctaattcccaaatggctcaagtcggtgacgg 

15 tgataattcacctttaatgaataatttccgtcaatatttaccttccctccctcaatcgg 
ttgaatgtcgcccttttgtctttggcccaatacgcaaaccgcctctccccgcgcgttgg 
ccgattcattaatgcagctggcacgacaggtttcccgactggaaagcgggcagtgagcg 
caacgcaattaatgtgagttagctcactcattaggcaccccaggctttacactttatgc 
ttccggctcgtatgttgtgtggaattgtgagcggataacaatttcacacaggaaacagc 

20 tatqaccatgattacgccaagcttgcatgcctgcag cccacagatggttagagaggctt 
acgcagcaggtctcatcaagacgatctacccgagcaataatctccaggaaatcaaatac 
cttcccaagaaggttaaagatgcagtcaaaagattcaggactaactgcatcaagaacac 
agagaaagatatatttctcaagatcagaagtactattccagtatggacgattcaaggct 
tgcttcacaaaccaaggcaagtaatagagattggagtctctaaaaaggtagttcccact 

25 gaatcaaaggccatggagtcaaagattcaaatagaggacctaacagaactcgccgtaaa 
gactggcgaacagttcatacagagtctcttacgactcaatgacaagaagaaaatcttcg 
tcaacatggtggagcacgacacacttgtctactccaaaaatatcaaagatacagtctca 
gaagaccaaagggcaattgagacttttcaacaaagggtaatatccggaaacctcctcgg 
attccattgcccagctatctgtcactttattgtgaagatagtggaaaaggaaggtggct 

30 cctacaaatgccatcattgcgataaaggaaaggccatcgttgaagatgcctctgccgac 
agtggtcccaaagatggacccccacccacgaggagcatcgtggaaaaagaagacgttcc 
aaccacgtcttcaaagcaagtggattgatgtgatatctccactgacgtaagggatgacg 
cacaatcccactatccttcgcaagacccttcctctatataaggaagttcatttcatttg 
gagagaacacgggg gactctagaggatccatggcgat'bcc'b'btca'bggaaaccgtcg'b-b 
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ggttttatgatagtgatgtacgtttttgagacgtatttggatctgaggcaacatactgc 
tctcaagcttcccactctcccaaagactttggttggagtcattagccaagagaagtttg 
agaaatctcgagcttacagtcttgacaaaagccattttcactttgttcatgagtttgtt 
actatacttatggactctgcgattctgttctttgggatcttgccttggttttggaagat 
atctggcggctttctaccaatggtgggactcgatccagagaatgaaatcctgcacactc 
tttcattcttggctggtcttatgacatggtcacagatcactgatttgccattttctttg 
tactcaactttcgtgatcgagtctcggcatgggttcaacaaacaaacaatatggatgtt 
cattagggacatgatcaaaggaatactcctctctgtcatacctgcccctcctatcgttg 
ccgcaattattgttatagttcagaaaggaggtccttacctcgccatctatctgtgggca 
ttcatgtttatcctgtctctagtgatgatgactatataccctgttttgattgcacctct 
tttcaacaagttcactcctcttcctgatggagacctccgggagaagattgagaaacttg 
cttcttctctaaagtttcctctgaagaagctgtttgttgtcgatggatctacaaggtca 
agccatagtaatgcttacatgtatggtttcttcaagaacaaaaggattgttctttatga 
cacattgattcagcagtgccagaatgagaatgaaattgtggcggttattgcacacgagc 
tgggacactggaagctgaatcacactacatactcgttcattgctgttcaaatccttgcc 
ttcttgcaatttggaggatacactcttgtcagaaactccactgatctcttcaggagttt 
tggttttgatacacaaccagttctcattggtttgatcatatttcagcacactgtaatac 
cacttcaacacctagtaagctttgacctcaaccttgttagtcgagcgtttgagtttcag 
gctgatgcttttgcagtgaatcttggttatgcaaaggatctacgtcctgccctagtgaa 
gctacaggaagagaacttatcagcgatgaacacagacccattgtactcagcttatcact 
actcacaccctcctcttgtagagaggcttcgagccattgatggagaagacaagaagaca 
gattaacccctcgaatttccccgatcgttcaaacatttggcaataaagtttcttaagat 
tgaatcctgttgccggtcttgcgatgattatcatataatttctgttgaattacgttaag 
catgtaataattaacatgtaatgcatgacgttatttatgagatgggtttttatgattag 
agtcccgcaattatacatttaatacgcgatagaaaacaaaatatagcgcgcaaactagg 
ataaattatcgcgcgcggtgtcatctatgttactagatcgggaattcactggccgtcgt 
tttacaacgtcgtgactgggaaaaccctggcgttacccaacttaatcgccttgcagcac 
atccccctttcgccagctggcgtaatagcgaagaggcccgcaccgatcgcccttcccaa 
cagttgcgcagcctgaatggcgcccgctcctttcgctttcttcccttcctttctcgcca 
cgttcgccggctttccccgtcaagctctaaatcgggggctccctttagggttccgattt 
agtgctttacggcacctcgaccccaaaaaacttgatttgggtgatggttcacgtagtgg 
gccatcgccctgatagacggtttttcgccctttgacgttggagtccacgttctttaata 
gtggactcttgttccaaactggaacaacactcaaccctatctcgggctattcttttgat 
ttataagggattttgccgatttcggaaccaccatcaaacaggattttcgcctgctgggg 
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caaaccagcgtggaccgcttgctgcaactctctcagggccaggcggtgaagggcaatca 
gctgttgcccgtctcactggtgaaaagaaaaaccaccccagtacattaaaaacgtccgc 
aatgtgttattaagttgtctaagcgtcaatttgtttacaccacaa tatatcctgcca 

5 SEQ ID NO:47 is the nucleic acid sequence of pBI 1 2 1 -BnCPP. Italicized 

sequences are the right and left border repeats. Underlined sequence is the 35S promoter. 
Sequence in bold is the BnCPP antisense sequence. 

SEQ ID NO:48 

10 gtttacccgccaata tat cctgt caaacactgatagtttaaactgaaggcgggaaacga 
caatctgatcatgagcggagaattaagggagtcacgttatgacccccgccgatgacgcg 
ggacaagccgttttacgtttggaactgacagaaccgcaacgttgaaggagccactcagc 
cgcgggtttctggagtttaatgagctaagcacatacgtcagaaaccattattgcgcgtt 
caaaagtcgcctaaggtcactatcagctagcaaatatttcttgtcaaaaatgctccact 

15 gacgttccataaattcccctcggtatccaattagagtctcatattcactctcaatccaa 
ataatctgcaccggatctggatcgtttcgcatgattgaacaagatggattgcacgcagg 
ttctccggccgcttgggtggagaggctattcggctatgactgggcacaacagacaatcg 
gctgctctgatgccgccgtgttccggctgtcagcgcaggggcgcccggttctttttgtc 
aagaccgacctgtccggtgccctgaatgaactgcaggacgaggcagcgcggctatcgtg 

20 gctggccacgacgggcgttccttgcgcagctgtgctcgacgttgtcactgaagcgggaa 
gggact ggct get at tgggcgaagtgccggggcaggatct cctgt cat ctcacctt get 
cctgccgagaaagtatccatcatggctgatgcaatgcggcggctgcatacgcttgatcc 
ggct acctgcccattcgaccaccaagcgaaacatcgcatcgagcgagcacgt act egga 
tggaagccggtcttgtcgatcaggatgatctggacgaagagcatcaggggctcgcgcca 

25 gccgaactgttcgccaggctcaaggcgcgcatgcccgacggcgatgatctcgtcgtgac 
ccatggcgatgcctgcttgccgaatatcatggtggaaaatggccgcttttctggattca 
tcgactgtggccggctgggtgtggcggaccgctatcaggacatagcgttggctacccgt 
gatattgctgaagagcttggcggcgaatgggctgaccgcttcctcgtgctttacggtat 
cgccgctcccgattcgcagcgcatcgccttctatcgccttcttgacgagttcttctgag 

30 cgggactctggggttcgaaatgaccgaccaagcgacgcccaacctgccatcacgagatt 
tcgattccaccgccgccttctatgaaaggttgggcttcggaatcgttttccgggacgcc 
ggct ggat gat cctccagcgcggggatct cat get ggagttcttcgcccacgggat etc 
tgcggaacaggcggtcgaaggtgccgatatcattacgacagcaacggccgacaagcaca 
acgccacgatcctgagcgacaatatgatcgggcccggcgtccacatcaacggcgtcggc 
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ggcgactgcccaggcaagaccgagatgcaccgcgatatcttgctgcgttcggatatttt 

cgtggagttcccgccacagacccggatgatccccgatcgttcaaacatttggcaataaa 

gtttcttaagattgaatcctgttgccggtcttgcgatgattatcatataatttctgttg 

aattacgttaagcatgtaataattaacatgtaatgcatgacgttatttatgagatgggt 

ttttatgattagagtcccgcaattatacatttaatacgcgatagaaaacaaaatatagc 

gcgcaaactaggataaattatcgcgcgcggtgtcatctatgttactagatcgggcctcc 

tgtcaatgctggcggcggctctggtggtggttctggtggcggctctgagggtggtggct 

ctgagggtggcggttctgagggtggcggctctgagggaggcggttccggtggtggctct 

ggttccggtgattttgattatgaaaagatggcaaacgctaataagggggctatgaccga 

aaatgccgatgaaaacgcgctacagtctgacgctaaaggcaaacttgattctgtcgcta 

ctgattacggtgctgctatcgatggtttcattggtgacgtttccggccttgctaatggt 

aatggtgctactggtgattttgctggctctaattcccaaatggctcaagtcggtgacgg 

tgataattcacctttaatgaataatttccgtcaatatttaccttccctccctcaatcgg 

ttgaatgtcgcccttttgtctttggcccaatacgcaaaccgcctctccccgcgcgttgg 

ccgattcattaatgcagctggcacgacaggtttcccgactggaaagcgggcagtgagcg 

caacgcaattaatgtgagttagctcactcattaggcaccccaggctttacactttatgc 

ttccggctcgtatgttgtgtggaattgtgagcggataacaatttcacacaggaaacagc 

tatqaccatqattacqccaaqcttqcatgcctgcag cccacagatggttagagaggctt 

acgcagcaggtctcatcaagacgatctacccgagcaataatctccaggaaatcaaatac 

cttcccaagaaggttaaagatgcagtcaaaagattcaggactaactgcatcaagaacac 

agagaaagatatatttctcaagatcagaagtactattccagtatggacgattcaaggct 

tgcttcacaaaccaaggcaagtaatagagattggagtctctaaaaaggtagttcccact 

gaatcaaaggccatggagtcaaagattcaaatagaggacctaacagaactcgccgtaaa 

gactggcgaacagttcatacagagtctcttacgactcaatgacaagaagaaaatcttcg 

tcaacatggtggagcacgacacacttgtctactccaaaaatatcaaagatacagtctca 

gaagaccaaagggcaattgagacttttcaacaaagggtaatatccggaaacctcctcgg 

attccattgcccagctatctgtcactttattgtgaagatagtggaaaaggaaggtggct 

cctacaaatgccatcattgcgataaaggaaaggccatcgttgaagatgcctctgccgac 

agtggtcccaaagatggacccccacccacgaggagcatcgtggaaaaagaagacgttcc 

aaccacgtcttcaaagcaagtggattgatgtgatatctccactgacgtaagggatgacg 

cacaatcccactatccttcgcaagacccttcctctatataaggaagttcatttcatttg 

gagagaacacgggg gactctagaccagtg-bcccagc-bcg-bg-bgcaataaccgccacaat 

ttcattctcattctggcactgctgaatcaatgtgtcataaagaacaatccttttgttct 

tgaagaaaccatacatgtaagcattactatggcttgaccttgtagatccatcgacaaca 



122 



WO 03/012116 PCT/IB02/03887 

aacagcttcttcagaggaaactttagagaagaagcaagtttctcaatcttctcccggag 
gtctccatcaggaagaggagtgaacttgttgaaaagaggtgcaatcaaaacagggtata 
tagtcatcatcactagagacaggataaacatgaatgcccacagatagatggcgaggtaa 
ggacctcctttctgaactataacaataattgcggcaacgataggaggggcaggtatgac 
5 agagaggagtattcctttgatcatgtccctaatgaacatccatattgtttgtttgttga 
acccatgccgagactcgatcacgaaagttgagtacaaagaaaatggcaaatcagtgatc 
tgtgaccatgtcataagaccagccaagaatgaaagagtgtgcaggatttcattctctgg 
atcgagtcccaccattggtagaaggatccccATCTACCCGCTTCGCGTCGGCATCCGGT 
CAGTGGCAGTGAAGGGCGAACAGTTCCTGATTAACCACAAACCGTTCTACTTTACTGGC 

10 TTTGGTCGTCATGAAGATGCGGACTTGCGTGGCAAAGGATTCGATAACGTGCTGATGGT 
GCACGACCACGCATTAATGGACTGGATTGGGGCCAACTCCTACCGTACCTCGCATTACC 
CTTACGCTGAAGAGATGCTCGACTGGGCAGATGAACATGGCATCGTGGTGATTGATGAA 
ACTGCTGCTGTCGGCTTTTCGCTCTCTTTAGGCATTGGTTTCGAAGCGGGCAACAAGCC 
GAAAGAACTGTACAGCGAAGAGGCAGTCAACGGGGAAACTCAGCAAGCGCACTTACAGG 

15 CGATTTVAAGAGCTGATAGCGCGTGACAAAAACCACCCAAGCGTGGTGATGTGGAGTATT 
GCCAACGAACCGGATACCCGTCCGCAAGGTGCACGGGAATATTTCGCGCCACTGGCGGA 
AGCAACGCGTAAACTCGACCCGACGCGTCCGATCACCTGCGTCAATGTAATGTTCTGCG 
ACGCTCACACCGATACCATCAGCGATCTCTTTGATGTGCTGTGCCTGAACCGTTATTAC 
GGATGGTATGTCCAAAGCGGCGATTTGGAAACGGCAGAGAAGGTACTGGAAAAAGAACT 

20 TCTGGCCTGGCAGGAGAAACTGTACACCGACATGTGGAGTGAAGAGTATCAGTGTGCAT 
GGCTGGATATGTATCACCGCGTCTTTGATCGCGTCAGCGCCGTCGTCGGTGAACAGGTA 
TGGAATTTCGCCGATTTTGCGACCTCGCAAGGCATATTGCGCGTTGGCGGTAACAAGAA 
AGGGATCTTCACTCGCGACCGCAAACCGAAGTCGGCGGCTTTTCTGCTGCAAAAACGCT 
GGACTGGCATGAACTTCGGTGAAAAACCGCAGCAGGGAGGCAAACAATGAatcaacaac 

25 tctcctggcgcaccatcgtcggctacagcctcgggaattgctaccgagctcttctacca 
a-bggtgggac^cgatccagagaatgaaa-bcctgcacactc-b-btcati-bcttggctggtci: 
i:at:gacatggtcacagat:cact:ga-tt:tgccat:i:t:tct:t.tgtac , bcaact-b , bcgtga-bcg 
agtctcggcatgggttcaacaaacaaacaatatggatgttcattagggacatgatcaaa 
ggaatac'bcctictictg'bcatacctgcccctccliatcg-ttgccgcaa-btattgtta-bagt: 

30 •bcagaaaggaggtccttacct:cgccat:ct:atct:gtgggca'bt:cat:gt-b-bat:cct:gt:ctc 
t:agt:gatgatgact:at:at:accct:gt:t:t:t:gat:-bgcacc'bct < b-btcaacaagtt:cac-bcc-b 
ct:t:cct:ga , bggagacct:ccgggagaaga , bt:gagaaact,tgct:t:c 4 b < bct:ct:aaagtt:'bcc 
•bctgaagaagct.gtttgt:'bgt:cga'bggatctacaagg'bcaagccat:agtaat:gct-taca 
tg-batggtttct;t:caagaacaaaaggat:tgt:tctt:t:a^gacaca^t:ga , btcagcag-bgc 
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cagaatgagaatgaaattgtggcggttattgcacacgagctgggacactgg qaqctcqa 

atttccccgatcgttcaaacatttggcaataaagtttcttaagattgaatcctgttgcc 
ggtcttgcgatgattatcatataatttctgttgaattacgttaagcatgtaataattaa 
catgtaatgcatgacgttatttatgagatgggtttttatgattagagtcccgcaattat 
acatttaatacgcgatagaaaacaaaatatagcgcgcaaactaggataaattatcgcgc 
gcggtgtcatctatgttactagatcgggaattcactggccgtcgttttacaacgtcgtg 
actgggaaaaccctggcgttacccaacttaatcgccttgcagcacatccccctttcgcc 
agctggcgtaatagcgaagaggcccgcaccgatcgcccttcccaacagttgcgcagcct 
gaatggcgcccgctcctttcgctttcttcccttcctttctcgccacgttcgccggcttt 
ccccgtcaagctctaaatcgggggctccctttagggttccgatttagtgctttacggca 
cctcgaccccaaaaaacttgatttgggtgatggttcacgtagtgggccatcgccctgat 
agacggtttttcgccctttgacgttggagtccacgttctttaatagtggactcttgttc 
caaactggaacaacactcaaccctatctcgggctattcttttgatttataagggatttt 
gccgatttcggaaccaccatcaaacaggattttcgcctgctggggcaaaccagcgtgga 
ccgcttgctgcaactctctcagggccaggcggtgaagggcaatcagctgttgcccgtct 
cactggtgaaaagaaaaaccaccccagtacattaaaaacgtccgcaatgtgttattaag 
ttgtctaagcgtcaatttgtt ta caeca ca a tata tcctgcca 

SEQ ID NO:48 is the nucleic acid sequence of pBI121-HP-BnCPP. Italicized 
sequences are the right and left border repeats. Underlined sequence is the 35S promoter. 
Sequence in bold is the BnCPP antisense sequence, bold and underlined sequence is the 
BnCPP sense fragment and upper case indicates the truncated GUS fragment. 

SEQ ID NO:49 

gt t ta cccgcca a tata tec tgtcaaacact gat agtttaaact gaaggegggaaaega 
caatctgatcatgagcggagaattaagggagtcacgttatgacccccgccgatgacgcg 
ggacaagccgttttacgtttggaactgacagaaccgcaacgttgaaggagccactcagc 
cgcgggtttctggagtttaatgagctaagcacatacgtcagaaaccattattgcgcgtt 
caaaagtcgcctaaggtcactatcagctagcaaatatttcttgtcaaaaatgctccact 
gacgttccataaattcccctcggtatccaattagagtctcatattcactctcaatccaa 
ataatctgcaccggatctggatcgtttcgcatgattgaacaagatggattgcacgcagg 
ttctccggccgcttgggtggagaggctattcggctatgactgggcacaacagacaatcg 
gctgctctgatgccgccgtgttccggctgtcagcgcaggggcgcccggttctttttgtc 
aagaccgacctgtccggtgccctgaatgaactgcaggacgaggcagcgcggctatcgtg 
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gctggccacgacgggcgttccttgcgcagctgtgctcgacgttgtcactgaagcgggaa 
gggactggctgctattgggcgaagtgccggggcaggatctcctgtcatctcaccttgct 
cctgccgagaaagtatccatcatggctgatgcaatgcggcggctgcatacgcttgatcc 
ggctacctgcccattcgaccaccaagcgaaacatcgcatcgagcgagcacgtactcgga 
tggaagccggtcttgtcgatcaggatgatctggacgaagagcatcaggggctcgcgcca 
gccgaactgttcgccaggctcaaggcgcgcatgcccgacggcgatgatctcgtcgtgac 
ccatggcgatgcctgcttgccgaatatcatggtggaaaatggccgcttttctggattca 
tcgactgtggccggctgggtgtggcggaccgctatcaggacatagcgttggctacccgt 
gatattgctgaagagcttggcggcgaatgggctgaccgcttcctcgtgctttacggtat 
cgccgctcccgattcgcagcgcatcgccttctatcgccttcttgacgagttcttctgag 
cgggactctggggttcgaaatgaccgaccaagcgacgcccaacctgccatcacgagatt 
tcgattccaccgccgccttctatgaaaggttgggcttcggaatcgttttccgggacgcc 
ggctggatgatcctccagcgcggggatctcatgctggagttcttcgcccacgggatctc 
tgcggaacaggcggtcgaaggtgccgatatcattacgacagcaacggccgacaagcaca 
acgccacgatcctgagcgacaatatgatcgggcccggcgtccacatcaacggcgtcggc 
ggcgactgcccaggcaagaccgagatgcaccgcgatatcttgctgcgttcggatatttt 
cgtggagttcccgccacagacccggatgatccccgatcgttcaaacatttggcaataaa 
gtttcttaagattgaatcctgttgccggtcttgcgatgattatcatataatttctgttg 
aattacgttaagcatgtaataattaacatgtaatgcatgacgttatttatgagatgggt 
ttttatgattagagtcccgcaattatacatttaatacgcgatagaaaacaaaatatagc 
gcgcaaactaggataaattatcgcgcgcggtgtcatctatgttactagatcgggcctcc 
tgtcaatgctggcggcggctctggtggtggttctggtggcggctctgagggtggtggct 
ctgagggtggcggttctgagggtggcggctctgagggaggcggttccggtggtggctct 
ggttccggtgattttgattatgaaaagatggcaaacgctaataagggggctatgaccga 
aaatgccgatgaaaacgcgctacagtctgacgctaaaggcaaacttgattctgtcgcta 
ctgattacggtgctgctatcgatggtttcattggtgacgtttccggccttgctaatggt 
aatggtgctactggtgattttgctggctctaattcccaaatggctcaagtcggtgacgg 
tgataattcacctttaatgaataatttccgtcaatatttaccttccctccctcaatcgg 
ttgaatgtcgcccttttgtctttggcccaatacgcaaaccgcctctccccgcgcgttgg 
ccgattcattaatgcagctggcacgacaggtttcccgactggaaagcgggcagtgagcg 
caacgcaattaatgtgagttagctcactcattaggcaccccaggctttacactttatgc 
ttccggctcgtatgttgtgtggaattgtgagcggataacaatttcacacaggaaacagc 
tatqaccatqattacgccaaqcttgcatQCCtqcag cccacagatggttagagaggctt 
acgcagcaggtctcatcaagacgatctacccgaqcaataatctccaggaaatcaaatac 
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cttcccaagaaggttaaagatgcagtcaaaagattcaggactaactgcatcaagaacac 
agagaaagatatatttctcaagatcagaagtactattccagtatggacgattcaaggct 
tgcttcacaaaccaaggcaagtaatagagattggagtctctaaaaaggtagttcccact 
gaatcaaaggccatggagtcaaagattcaaatagaggacctaacagaactcgccgtaaa 
gactggcgaacagttcatacagagtctcttacgactcaatgacaagaagaaaatcttcg 
tcaacatggtggagcacgacacacttgtctactccaaaaatatcaaagatacagtctca 
gaagaccaaagggcaattgagacttttcaacaaagggtaatatccggaaacctcctcgg 
attccattgcccagctatctgtcactttattgtgaagatagtggaaaaggaaggtggct 
cctacaaatgccatcattgcgataaaggaaaggccatcgttgaagatgcctctgccgac 
agtggtcccaaagatggacccccacccacgaggagcatcgtggaaaaagaagacgttcc 
aaccacgtcttcaaagcaagtggattgatgtgatatctccactgacgtaagggatgacg 
cacaatcccactatccttcgcaagacccttcctctatataaggaagttcatttcatttg 
gagagaacacgggg gactctagaggatcct:taatct:g , bct: , bct:'bgt:ct:tct:cca-bcaat: 
ggctcgaagcctctctacaagaggagggtgtgagtagtgataagctgagtacaatgggt 
ctgtgttcatcgctgataagttctcttcctgtagcttcactagggcaggacgtagatcc 
tttgcataaccaagattcactgcaaaagcatcagcctgaaactcaaacgctcgactaac 
aaggttgaggtcaaagcttactaggtgttgaagtggtattacagtgtgctgaaatatga 
tcaaaccaatgagaactggttgtgtatcaaaaccaaaactcctgaagagatcagtggag 
tttctgacaagagtgtatcctccaaattgcaagaaggcaaggatttgaacagcaatgaa 
cgagtatgtagtgtgattcagcttccagtgtcccagctcgtgtgcaataaccgccacaa 
tttcattctcattctggcactgctgaatcaatgtgtcataaagaacaatccttttgttc 
ttgaagaaaccatacatgtaagcattactatggcttgaccttgtagatccatcgacaac 
aaacagcttcttcagaggaaactttagagaagaagcaagtttctcaatcttctcccgga 
ggtctccatcaggaagaggagtgaacttgttgaaaagaggtgcaatcaaaacagggtat 
atagtcatcatcactagagacaggataaacatgaatgcccacagatagatggcgaggta 
aggacctcctttctgaactataacaataattgcggcaacgataggaggggcaggtatga 
cagagaggag-tattcctttgatcatgtccctaatgaacatccatattgtttgtttgttg 
aacccatgccgagactcgatcacgaaagttgagtacaaagaaaatggcaaatcagtgat 
ctgtgaccatgtcataagaccagccaagaatgaaagagtgtgcaggatttcattctctg 
gatcgagtcccaccattggtagaaagccgccagatatcttccaaaaccaaggcaagatc 
ccaaagaacagaatcgcagagtccataagtatagtaacaaactcatgaacaaagtgaaa 
atggcttttgtcaagactgtaagctcgagatttctcaaacttctcttggctaatgactc 
caaccaaagtctttgggagagtgggaagcttgagagcagtatgttgcctcagatccaaa 
tacgtctcaaaaacgtacatcactatcataaaaccaacgacggtttccatgaaaggaat 
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cgccatcccctcgaatttccccgatcgttcaaacatttggcaataaagtttcttaagat 
tgaatcctgttgccggtcttgcgatgattatcatataatttctgttgaattacgttaag 
catgtaataattaacatgtaatgcatgacgttatttatgagatgggtttttatgattag 
agtcccgcaattatacatttaatacgcgatagaaaacaaaatatagcgcgcaaactagg 
ataaattatcgcgcgcggtgtcatctatgttactagatcgggaattcactggccgtcgt 
tttacaacgtcgtgactgggaaaaccctggcgttacccaacttaatcgccttgcagcac 
atccccctttcgccagctggcgtaatagcgaagaggcccgcaccgatcgcccttcccaa 
cagttgcgcagcctgaatggcgcccgctcctttcgctttcttcccttcctttctcgcca 
cgttcgccggctttccccgtcaagctctaaatcgggggctccctttagggttccgattt 
agtgctttacggcacctcgaccccaaaaaacttgatttgggtgatggttcacgtagtgg 
gccatcgccctgatagacggtttttcgccctttgacgttggagtccacgttctttaata 
gtggactcttgttccaaactggaacaacactcaaccctatctcgggctattcttttgat 
ttataagggattttgccgatttcggaaccaccatcaaacaggattttcgcctgctgggg 
caaaccagcgtggaccgcttgctgcaactctctcagggccaggcggtgaagggcaatca 
gctgttgcccgtctcactggtgaaaagaaaaaccaccccagtacattaaaaacgtccgc 
aatgtgttattaagttgtctaagcgtcaattt gt t ta caeca caata tat cctgcca 

SEQ ID NO:49 is the nucleic acid sequence of pBI121-antisense-BnCPP. 
Italicized sequences are the right and left border repeats. Underlined sequence is the 35 S 
promoter. Sequence in bold is the BnCPP antisense sequence. 

SEQ ID NO:50 

gt tta cccgcca a tata tcctgt caaacact gat agtttaaactgaaggcgggaaacga 
caatctgatcatgagcggagaattaagggagtcacgttatgacccccgccgatgacgcg 
ggacaagccgttttacgtttggaactgacagaaccgcaacgttgaaggagccactcagc 
cgcgggtttctggagtttaatgagctaagcacatacgtcagaaaccattattgcgcgtt 
caaaagtcgcctaaggtcactatcagctagcaaatatttcttgtcaaaaatgctccact 
gacgttccataaattcccctcggtatccaattagagtctcatattcactctcaatccaa 
ataatctgcaccggatctggatcgtttcgcatgattgaacaagatggattgcacgcagg 
ttctccggccgcttgggtggagaggctattcggctatgactgggcacaacagacaatcg 
gctgctctgatgccgccgtgttccggctgtcagcgcaggggcgcccggttctttttgtc 
aagaccgacctgtccggtgccctgaatgaactgcaggacgaggcagcgcggctatcgtg 
gctggccacgacgggcgttccttgcgcagctgtgctcgacgttgtcactgaagcgggaa 
gggactggctgctattgggcgaagtgccggggcaggatctcctgtcatctcaccttgct 
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cctgccgagaaagtatccatcatggctgatgcaatgcggcggctgcatacgcttgatcc 
ggctacctgcccattcgaccaccaagcgaaacatcgcatcgagcgagcacgtactcgga 
tggaagccggtcttgtcgatcaggatgatctggacgaagagcatcaggggctcgcgcca 
gccgaactgttcgccaggctcaaggcgcgcatgcccgacggcgatgatctcgtcgtgac 
ccatggcgatgcctgcttgccgaatatcatggtggaaaatggccgcttttctggattca 
tcgactgtggccggctgggtgtggcggaccgctatcaggacatagcgttggctacccgt 
gatattgctgaagagcttggcggcgaatgggctgaccgcttcctcgtgctttacggtat 
cgccgctcccgattcgcagcgcatcgccttctatcgccttcttgacgagttcttctgag 
cgggactctggggttcgaaatgaccgaccaagcgacgcccaacctgccatcacgagatt 
tcgattccaccgccgccttctatgaaaggttgggcttcggaatcgttttccgggacgcc 
ggctggatgatcctccagcgcggggatctcatgctggagttcttcgcccacgggatctc 
tgcggaacaggcggtcgaaggtgccgatatcattacgacagcaacggccgacaagcaca 
acgccacgatcctgagcgacaatatgatcgggcccggcgtccacatcaacggcgtcggc 
ggcgactgcccaggcaagaccgagatgcaccgcgatatcttgctgcgttcggatatttt 
cgtggagttcccgccacagacccggatgatccccgatcgttcaaacatttggcaataaa 
gtttcttaagattgaatcctgttgccggtcttgcgatgattatcatataatttctgttg 
aattacgttaagcatgtaataattaacatgtaatgcatgacgttatttatgagatgggt 
ttttatgattagagtcccgcaattatacatttaatacgcgatagaaaacaaaatatagc 
gcgcaaactaggataaattatcgcgcgcggtgtcatctatgttactagatcgggcctcc 
tgtcaatgctggcggcggctctggtggtggttctggtggcggctctgagggtggtggct 
ctgagggtggcggttctgagggtggcggctctgagggaggcggttccggtggtggctct 
ggttccggtgattttgattatgaaaagatggcaaacgctaataagggggctatgaccga 
aaatgccgatgaaaacgcgctacagtctgacgctaaaggcaaacttgattctgtcgcta 
ctgattacggtgctgctatcgatggtttcattggtgacgtttccggccttgctaatggt 
aatggtgctactggtgattttgctggctctaattcccaaatggctcaagtcggtgacgg 
tgataattcacctttaatgaataatttccgtcaatatttaccttccctccctcaatcgg 
ttgaatgtcgcccttttgtctttggcccaatacgcaaaccgcctctccccgcgcgttgg 
ccgattcattaatgcagctggcacgacaggtttcccgactggaaagcgggcagtgagcg 
caacgcaattaatgtgagttagctcactcattaggcaccccaggctttacactttatgc 
ttccggctcgtatgttgtgtggaattgtgagcggataacaatttcacacaggaaacagc 
tatgaccatqattacgccaagcttgcatgcctgcag ggagccatagatgcaattcaatc 
aaactgaaatttctgcaagaatctcaaacacggagatctcaaagtttgaaagaaaattt 
atttcttcgactcaaaacaaacttacgaaatttaggtagaacttatatacattatattg 
taattttttgtaacaaaatgtttttattattattatagaattttactggttaaattaaa 
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aatgaatagaaaaggtgaattaagaggagagaggaggtaaacattttcttctatttttt 
catattttcaggataaattattgtaaaagtttacaagatttccatttgactagtgtaaa 
tgaggaatattctctagtaagatcattatttcatctacttcttttatcttctaccagta 
gaggaataaacaatatttagctcctttgtaaatacaaattaattttccttcttgacatc 
attcaattttaattttacgtataaaataaaagatcatacctattagaacgattaaggag 
aaatacaattcgaatgagaaggatgtgccgtttgttataataaacagccacacgacgta 
aacgtaaaatgaccacatgatgggccaatagacatggaccgactactaataatagtaag 
ttacattttaggatggaataaatatcataccgacatcagttttgaaagaaaagggaaaa 
aaagaaaaaataaataaaagatatactaccgacatgagttccaaaaagcaaaaaaaaag 
atcaagccgacacagacacgcgtagagagcaaaatgactttgacgtcacaccacgaaaa 
cagacgcttcatacgtgtccctttatctctctcagtctctctataaacttagtgagacc 
ctcctctgttttactcacaaatatgcaaactagaaaacaatcatcaggaataaaggg tt 
tgattacttctattggaaaggactctagaggatccatggcgattcctttcatggaaacc 
gtcgttggttttatgatagtgatgtacgtttttgagacgtatttggatctgaggcaaca 
tactgctctcaagcttcccactctcccaaagactttggttggagtcattagccaagaga 
agtttgagaaatctcgagcttacagtct-tgacaaaagccattttcactttgttcatgag 
tttgttactatacttatggactctgcgattctgtt^ 

gaagatatctggcggctttctaccaatggtgggactcgatccagagaatgaaatcctgc 
acactctttcattcttggctggtcttatgacatggtcacagatcactgatttgccattt 
tctttgtactcaactttcgtgatcgagtctcggcatgggttcaacaaacaaacaatatg 
gatgttcattagggacatgatcaaaggaatactcctctctgtcatacctgcccctccta 
tcgttgccgcaattattgttatagttcagaaaggaggtccttacctcgccatctatctg 
tgggcattcatgtttatcctgtctctagtgatgatgactatataccctgttttgattgc 
acctcttttcaacaagttcactcctcttcctgatggagacctccgggagaagattgaga 
aacttgcttcttctctaaagtttcctctgaagaagctgtttgttgtcgatggatctaca 
aggtcaagccatagtaatgcttacatgtatggtttcttcaagaacaaaaggattgttct 
ttatgacacattgattcagcagtgccagaatgagaatgaaattgtggcggttattgcac 
acgagctgggacactggaagctgaatcacactacatactcgttcattgctgttcaaatc 
cttgccttcttgcaatttggaggatacactcttgtcagaaactccactgatctcttcag 
gagttttggttttgatacacaaccagttctcattggtttgatcatatttcagcacactg 
taataccacttcaacacctagtaagctttgacctcaaccttgttagtcgagcgtttgag 
tttcaggctgatgcttttgcagtgaatcttggttatgcaaaggatctacgtcctgccct 
agtgaagctacaggaagagaacttatcagcgatgaacacagacccattgtactcagctt 
atcactactcacaccctcctcttgtagagaggcttcgagccattgatggagaagacaag 
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aagacagattaacccctcgaatttccccgatcgttcaaacatttggcaataaagtttct 
taagattgaatcctgttgccggtcttgcgatgattatcatataatttctgttgaattac 
gttaagcatgtaataattaacatgtaatgcatgacgttatttatgagatgggtttttat 
gattagagtcccgcaattatacatttaatacgcgatagaaaacaaaatatagcgcgcaa 
actaggataaattatcgcgcgcggtgtcatctatgttactagatcgggaattcactggc 
cgtcgttttacaacgtcgtgactgggaaaaccctggcgttacccaacttaatcgccttg 
cagcacatccccctttcgccagctggcgtaatagcgaagaggcccgcaccgatcgccct 
tcccaacagttgcgcagcctgaatggcgcccgctcctttcgctttcttcccttcctttc 
tcgccacgttcgccggctttccccgtcaagctctaaatcgggggctccctttagggttc 
cgatttagtgctttacggcacctcgaccccaaaaaacttgatttgggtgatggttcacg 
tagtgggccatcgccctgatagacggtttttcgccctttgacgttggagtccacgttct 
ttaatagtggactcttgttccaaactggaacaacactcaaccctatctcgggctattct 
tttgatttataagggattttgccgatttcggaaccaccatcaaacaggattttcgcctg 
ctggggcaaaccagcgtggaccgcttgctgcaactctctcagggccaggcggtgaaggg 
caatcagctgttgcccgtctcactggtgaaaagaaaaaccaccccagtacattaaaaac 
gtccgcaatgtgttattaagttgtctaagcgtcaatttgtttacaccacaa tatatcct 
gcca 

SEQ ID NO:50 is the nucleic acid sequence of pRD29A-BnCPP. Italicized 
sequences are the right and left border repeats. Underlined sequence is the RD29A 
promoter. Sequence in bold is the BnCPP sense sequence. 

SEQ ID NO: 51 

gtttacccgccaa ta ta tcctgt caaacact gat agtttaaactgaaggcgggaaacga 
caatctgatcatgagcggagaattaagggagtcacgttatgacccccgccgatgacgcg 
ggacaagccgttttacgtttggaactgacagaaccgcaacgttgaaggagccactcagc 
cgcgggtttctggagtttaatgagctaagcacatacgtcagaaaccattattgcgcgtt 
caaaagtcgcctaaggtcactatcagctagcaaatatttcttgtcaaaaatgctccact 
gacgttccataaattcccctcggtatccaattagagtctcatattcactctcaatccaa 
ataatctgcaccggatctggatcgtttcgcatgattgaacaagatggattgcacgcagg 
ttctccggccgcttgggtggagaggctattcggctatgactgggcacaacagacaatcg 
gctgctctgatgccgccgtgttccggctgtcagcgcaggggcgcccggttctttttgtc 
aagaccgacctgtccggtgccctgaatgaactgcaggacgaggcagcgcggctatcgtg 
gctggccacgacgggcgttccttgcgcagctgtgctcgacgttgtcactgaagcgggaa 
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gggactggctgctattgggcgaagtgccggggcaggatctcctgtcatctcaccttgct 
cctgccgagaaagtatccatcatggctgatgcaatgcggcggctgcatacgcttgatcc 
ggctacctgcccattcgaccaccaagcgaaacatcgcatcgagcgagcacgtactcgga 
tggaagccggtcttgtcgatcaggatgatctggacgaagagcatcaggggctcgcgcca 
5 gccgaactgttcgccaggctcaaggcgcgcatgcccgacggcgatgatctcgtcgtgac 
ccatggcgatgcctgcttgccgaatatcatggtggaaaatggccgcttttctggattca 
tcgactgtggccggctgggtgtggcggaccgctatcaggacatagcgttggctacccgt 
gatattgctgaagagcttggcggcgaatgggctgaccgcttcctcgtgctttacggtat 
cgccgctcccgattcgcagcgcatcgccttctatcgccttcttgacgagttcttctgag 

10 cgggactctggggttcgaaatgaccgaccaagcgacgcccaacctgccatcacgagatt 
tcgattccaccgccgccttctatgaaaggttgggcttcggaatcgttttccgggacgcc 
ggctggatgatcctccagcgcggggatctcatgctggagttcttcgcccacgggatctc 
tgcggaacaggcggtcgaaggtgccgatatcattacgacagcaacggccgacaagcaca 
acgccacgatcctgagcgacaatatgatcgggcccggcgtccacatcaacggcgtcggc 

15 ggcgactgcccaggcaagaccgagatgcaccgcgatatcttgctgcgttcggatatttt 
cgtggagttcccgccacagacccggatgatccccgatcgttcaaacatttggcaataaa 
gtttcttaagattgaatcctgttgccggtcttgcgatgattatcatataatttctgttg 
aattacgttaagcatgtaataattaacatgtaatgcatgacgt tat t tat gaga tgggt 
ttttatgattagagtcccgcaattatacatttaatacgcgatagaaaacaaaatatagc 

20 gcgcaaactaggataaattatcgcgcgcggtgtcatctatgttactagatcgggcctcc 
tgtcaatgctggcggcggctctggtggtggttctggtggcggctctgagggtggtggct 
ctgagggtggcggttctgagggtggcggctctgagggaggcggttccggtggtggctct 
ggttccggtgattttgattatgaaaagatggcaaacgctaataagggggctatgaccga 
aaatgccgatgaaaacgcgctacagtctgacgctaaaggcaaacttgattctgtcgcta 

25 ctgattacggtgctgctatcgatggtttcattggtgacgtttccggccttgctaatggt 
aatggtgctactggtgattttgctggctctaattcccaaatggctcaagtcggtgacgg 
tgataattcacctttaatgaataatttccgtcaatatttaccttccctccctcaatcgg 
ttgaatgtcgcccttttgtctttggcccaatacgcaaaccgcctctccccgcgcgttgg 
ccgattcattaatgcagctggcacgacaggtttcccgactggaaagcgggcagtgagcg 

30 caacgcaattaatgtgagttagctcactcattaggcaccccaggctttacactttatgc 
ttccggctcgtatgttgtgtggaattgtgagcggataacaatttcacacaggaaacagc 
tatqaccatqattacgccaagcttqcatqcctgcag ggagccatagatgcaattcaatc 
aaactgaaatttctgcaagaatctcaaacacqqagatctcaaagtttgaaagaaaattt 
atttcttcqactcaaaacaaacttacqaaatttagqtaqaacttatatacattatattg 
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taattttttgtaacaaaatgtttttattattattataqaattttactggttaaattaaa 
aatgaatagaaaaggtgaattaagaggagagaqqaggtaaacattttcttctatttttt 
catattttcaggataaattattgtaaaagtttacaagatttccatttgactagtgtaaa 
tgaggaatattctctagtaagatcattatttcatctacttcttttatcttctaccagta 
5 gaggaataaacaatatttagctcctttgtaaatacaaattaattttccttcttgacatc 
attcaattttaattttacgtataaaataaaagatcatacctattagaacgattaaggag 
aaatacaattcgaatgagaaggatgtgccgtttgttataataaacagccacacgacgta 
aacgtaaaatgaccacatgatgggccaatagacatggaccgactactaataatagtaag 
ttacattttaggatggaataaatatcataccgacatcagttttgaaagaaaagggaaaa 

10 aaagaaaaaataaataaaagatatactaccgacatgagttccaaaaagcaaaaaaaaag 
atcaagccgacacagacacgcgtagagagcaaaatgactttgacgtcacaccacgaaaa 
cagacgcttcatacgtgtccctttatctctctcagtctctctataaacttagtgagacc 
ctcctctgttttactcacaaatatgcaaactagaaaacaatcatcaggaataaagggtt 
tgattacttctattggaaag gactctagaccagtg-bcccagc^cgt:gi:gcaa-baaccgc 

15 cacaatttcattctcattctggcactgctgaatcaatgtgtcataaagaacaatccttt 
tgttcttgaagaaaccatacatgtaagcattactatggcttgaccttgtagatccatcg 
acaacaaacagcttcttcagaggaaactttagagaagaagcaagtttctcaatcttctc 
ccggaggtctccatcaggaagaggagtgaacttgttgaaaagaggtgcaatcaaaacag 
ggtatatagtcatcatcactagagacaggataaacatgaatgcccacagatagatggcg 

20 aggtaaggacctcctttctgaactataacaataattgcggcaacgataggaggggcagg 
tatgacagagaggagtattcctttgatca-tgtccctaatgaacatccatattgtttgtt 
tgttgaacccatgccgagactcgatcacgaaagttgagtacaaagaaaatggcaaatca 
gtgatctgtgaccatgtcataagaccagccaagaatgaaagagtgtgcaggatttcatt 
ctctggatcgagtcccaccattggtagaaggatccccATCTACCCGCTTCGCGTCGGCA 

25 TCCGGTCAGTGGCAGTGAAGGGCGAACAGTTCCTGATTAACCACAAACCGTTCTACTTT 
ACTGGCTTTGGTCGTCATGAAGATGCGGACTTGCGTGGCAAAGGATTCGATAACGTGCT 
GATGGTGCACGACCACGCATTAATGGACTGGATTGGGGCCAACTCCTACCGTACCTCGC 
ATTACCCTTACGCTGAAGAGATGCTCGACTGGGCAGATGAACATGGCATCGTGGTGATT 
GATGAAACTGCTGCTGTCGGCTTTTCGCTCTCTTTAGGCATTGGTTTCGAAGCGGGCAA 

30 CAAGCCGAAAGAACTGTACAGCGAAGAGGCAGTCAACGGGGAAACTCAGCAAGCGCACT 
TACAGGCGATTAAAGAGCTGATAGCGCGTGACAAAAACCACCCAAGCGTGGTGATGTGG 
AGTATTGCCAACGAACCGGATACCCGTCCGCAAGGTGCACGGGAATATTTCGCGCCACT 
GGCGGAAGCAACGCGTAAACTCGACCCGACGCGTCCGATCACCTGCGTCAATGTAATGT 
TCTGCGACGCTCACACCGATACCATCAGCGATCTCTTTGATGTGCTGTGCCTGAACCGT 
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TATTACGGATGGTATGTCCAAAGCGGCGATTTGGAAACGGCAGAGAAGGTACTGGAAAA 
AGAACTTCTGGCCTGGCAGGAGAAACTGTACACCGACATGTGGAGTGAAGAGTATCAGT 
GTGCATGGCTGGATATGTATCACCGCGTCTTTGATCGCGTCAGCGCCGTCGTCGGTGAA 
CAGGTATGGAATTTCGCCGATTTTGCGACCTCGCAAGGCATATTGCGCGTTGGCGGTAA 
CAAGAAAGGGATCTTCACTCGCGACCGCAAACCGAAGTCGGCGGCTTTTCTGCTGCAAA 
AACGCTGGACTGGCATGAACTTCGGTGAAAAACCGCAGCAGGGAGGCAAACAATGAATC 
AACAACTCTCCTGGCGCACCATCGTCGGCTACAGCCTCGGGAATTGCTACCGAGCTCtt 
ctaccaatggtgggactcgatccagagaatgaaatcctgcacactctttcattcttggc 
tggtct^atgacatggtcacagatcactgatt^gccattt'bc'bttg'bac-bcaacti'b-bcg 
tga'bcgagtc-bcggcaiigggttcaacaaacaaacaa-ba'bgga-bgt'bcattagggaca'bg 
atcaaaggaatactcctctctgtcatacctgcccctcctatcgttgccgcaattattgt 
tatagttcagaaaggagg'tccttiacctcgcca-bc-ba-bctigtgggca-btca'bgtttatcc 
tgtctct:agtgatga'bgact:atataccctgtt < b1:gat: < bgcacctc-b < btt:caacaag'b-bc 
actcctcttcctgatggagacctccgggagaagattgagaaacttgcttcttctctaaa 
gtttcctcfcgaagaagctgtttgttgtcgatggatctacaaggtcaagccatagtaatg 
cttacatgtafcggtttct'bcaagaacaaaagga'b'tgt'bct'b'ba'bgacaca-b'bga'b'bcag 
cag-bgccagaatgagaatgaaa-btgtggcgg-btat tgcacacgagctgggacactggga 
gctcgaatttccccgatcgttcaaacatttggcaataaagtttcttaagattgaatcct 
gttgccggtcttgcgatgattatcatataatttctgttgaattacgttaagcatgtaat 
aattaacatgtaatgcatgacgttatttatgagatgggtttttatgattagagtcccgc 
aattatacatttaatacgcgatagaaaacaaaatatagcgcgcaaactaggataaatta 
tcgcgcgcggtgtcatctatgttactagatcgggaattcactggccgtcgttttacaac 
gtcgtgactgggaaaaccctggcgttacccaacttaatcgccttgcagcacatccccct 
ttcgccagctggcgtaatagcgaagaggcccgcaccgatcgcccttcccaacagttgcg 
cagcctgaatggcgcccgctcctttcgctttcttcccttcctttctcgccacgttcgcc 
ggctttccccgtcaagctctaaatcgggggctccctttagggttccgatttagtgcttt 
acggcacctcgaccccaaaaaacttgatttgggtgatggttcacgtagtgggccatcgc 
cctgatagacggtttttcgccctttgacgttggagtccacgttctttaatagtggactc 
ttgttccaaactggaacaacactcaaccctatctcgggctattcttttgatttataagg 
gattttgccgatttcggaaccaccatcaaacaggattttcgcctgctggggcaaaccag 
cgtggaccgcttgctgcaactctctcagggccaggcggtgaagggcaatcagctgttgc 
ccgtctcactggtgaaaagaaaaaccaccccagtacattaaaaacgtccgcaatgtgtt 
at t aagt t gt ct aagcgtcaat ttgttt a caccaca a ta ta tcctgcca 
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SEQ ID NO:51 is the nucleic acid sequence of pRD29A-HP-BnCPP. Italicized 
sequences are the right and left border repeats. Underlined sequence is the RD29A 
promoter. Sequence in bold is the BnCPP antisense sequence, bold and underlined 
sequence is BnCPP sense fragment and the upper case sequence represents the truncated 
5 GUS fragment. 



SEQ ID NO:52 

gtttacccgccaa ta ta tcctgt caaacact gat agtttaaactgaaggcgggaaacga 
caatctgatcatgagcggagaattaagggagtcacgttatgacccccgccgatgacgcg 

10 ggacaagccgttttacgtttggaactgacagaaccgcaacgttgaaggagccactcagc 
cgcgggtttctggagtttaatgagctaagcacatacgtcagaaaccattattgcgcgtt 
caaaagtcgcctaaggtcactatcagctagcaaatatttcttgtcaaaaatgctccact 
gacgttccataaattcccctcggtatccaattagagtctcatattcactctcaatccaa 
ataatctgcaccggatctggatcgtttcgcatgattgaacaagatggattgcacgcagg 

15 ttctccggccgcttgggtggagaggctattcggctatgactgggcacaacagacaatcg 
gctgctctgatgccgccgtgttccggctgtcagcgcaggggcgcccggttctttttgtc 
aagaccgacctgtccggtgccctgaatgaactgcaggacgaggcagcgcggctatcgtg 
gctggccacgacgggcgttccttgcgcagctgtgctcgacgttgtcactgaagcgggaa 
gggactggctgctattgggcgaagtgccggggcaggatctcctgtcatctcaccttgct 

20 cctgccgagaaagtatccatcatggctgatgcaatgcggcggctgcatacgcttgatcc 
ggctacctgcccattcgaccaccaagcgaaacatcgcatcgagcgagcacgtactcgga 
tggaagccggtcttgtcgatcaggatgatctggacgaagagcatcaggggctcgcgcca 
gccgaactgttcgccaggctcaaggcgcgcatgcccgacggcgatgatctcgtcgtgac 
ccatggcgatgcctgcttgccgaatatcatggtggaaaatggccgcttttctggattca 

25 tcgactgtggccggctgggtgtggcggaccgctatcaggacatagcgttggctacccgt 
gatattgctgaagagcttggcggcgaatgggctgaccgcttcctcgtgctttacggtat 
cgccgctcccgattcgcagcgcatcgccttctatcgccttcttgacgagttcttctgag 
cgggactctggggttcgaaatgaccgaccaagcgacgcccaacctgccatcacgagatt 
tcgattccaccgccgccttctatgaaaggttgggcttcggaatcgttttccgggacgcc 

30 ggctggatgatcctccagcgcggggatctcatgctggagttcttcgcccacgggatctc 
tgcggaacaggcggtcgaaggtgccgatatcattacgacagcaacggccgacaagcaca 
acgccacgatcctgagcgacaatatgatcgggcccggcgtccacatcaacggcgtcggc 
ggcgactgcccaggcaagaccgagatgcaccgcgatatcttgctgcgttcggatatttt 
cgtggagttcccgccacagacccggatgatccccgatcgttcaaacatttggcaataaa 
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gtttcttaagattgaatcctgttgccggtcttgcgatgattatcatataatttctgttg 
aattacgttaagcatgtaataattaacatgtaatgcatgacgttatttatgagatgggt 
ttttatgattagagtcccgcaattatacatttaatacgcgatagaaaacaaaatatagc 
gcgcaaactaggataaattatcgcgcgcggtgtcatctatgttactagatcgggcctcc 
tgtcaatgctggcggcggctctggtggtggttctggtggcggctctgagggtggtggct 
ctgagggtggcggttctgagggtggcggctctgagggaggcggttccggtggtggctct 
ggttccggtgattttgattatgaaaagatggcaaacgctaataagggggctatgaccga 
aaatgccgatgaaaacgcgctacagtctgacgctaaaggcaaacttgattctgtcgcta 
ctgattacggtgctgctatcgatggtttcattggtgacgtttccggccttgctaatggt 
aatggtgctactggtgattttgctggctctaattcccaaatggctcaagtcggtgacgg 
tgataattcacctttaatgaataatttccgtcaatatttaccttccctccctcaatcgg 
ttgaatgtcgcccttttgtctttggcccaatacgcaaaccgcctctccccgcgcgttgg 
ccgattcattaatgcagctggcacgacaggtttcccgactggaaagcgggcagtgagcg 
caacgcaattaatgtgagttagctcactcattaggcaccccaggctttacactttatgc 
ttccggctcgtatgttgtgtggaattgtgagcggataacaatttcacacaggaaacagc 
tatqaccatqattacqccaagcttgcatgcctgcag ggagccatagatgcaattcaatc 
aaactgaaatttctgcaagaatctcaaacacggagatctcaaagtttgaaagaaaattt 
atttcttcgactcaaaacaaacttacgaaatttaggtagaacttatatacattatattg 
taattttttgtaacaaaatgtttttattattattatagaattttactggttaaattaaa 
aatgaatagaaaaggtgaattaagaggagagaggaggtaaacattttcttctatttttt 
catattttcaggataaattattgtaaaagtttacaagatttccatttgactagtgtaaa 
tgaggaatattctctagtaagatcattatttcatctacttcttttatcttctaccagta 
gaggaataaacaatatttagctcctttgtaaatacaaattaattttccttcttgacatc 
attcaattttaattttacgtataaaataaaagatcatacctattagaacgattaaggag 
aaatacaattcgaatgagaaggatgtgccgtttgttataataaacagccacacgacgta 
aacgtaaaatgaccacatgatgggccaatagacatggaccgactactaataatagtaag 
ttacattttaggatggaataaatatcataccgacatcagttttgaaagaaaagggaaaa 
aaagaaaaaataaataaaagatatactaccgacatgagttccaaaaagcaaaaaaaaag 
atcaagccgacacagacacgcgtagagagcaaaatgactttgacgtcacaccacgaaaa 
cagacgcttcatacgtgtccctttatctctctcagtctctctataaacttagtgagacc 
ctcctctgttttactcacaaatatgcaaactagaaaacaatcatcaggaataaaggg tt 
tgattacttctattggaaaggactctagaggatccttaatctgtcttcttgtcttctcc 
atcaatggctcgaagcctctctacaagaggagggtgtgagtagtgataagctgagtaca 
atgggtctgtgttcatcgctgataagttctcttcctgtagcttcactagggcaggacgt 
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agatcctttgcataaccaagattcactgcaaaagcatcagcctgaaactcaaacgctcg 
actaacaaggttgaggtcaaagcttactaggtgttgaagtggtattacagtgtgctgaa 
atatgatcaaaccaatgagaactggttgtgtatcaaaaccaaaactcctgaagagatca 
gtggagtttctgacaagagtgtatcctccaaattgcaagaaggcaaggatttgaacagc 
aatgaacgagtatgtagtgtgattcagcttccagtgtcccagctcgtgtgcaataaccg 
ccacaatttcattctcattctggcactgctgaatcaatgtgtcataaagaacaatcctt 
ttgttcttgaagaaaccatacatgtaagcattactatggcttgaccttgtagatccatc 
gacaacaaacagcttcttcagaggaaactttagagaagaagcaagtttctcaatcttct 
cccggaggtctccatcaggaagaggagtgaacttgttgaaaagaggtgcaatcaaaaca 
gggtatatagtcatcatcactagagacaggataaacatgaatgcccacagatagatggc 
gaggtaaggacctcctttctgaactataacaataattgcggcaacgataggaggggcag 
gtatgacagagaggagtattcctttgatcatgtccctaatgaacatccatattgtttgt 
ttgttgaacccatgccgagactcgatcacgaaagttgagtacaaagaaaatggcaaatc 
agtgatctgtgaccatgtcataagaccagccaagaatgaaagagtgtgcaggatttcat 
tctctggatcgagtcccaccattggtagaaagccgccagatatcttccaaaaccaaggc 
aagatcccaaagaacagaatcgcagagtccataagtatagtaacaaactcatgaacaaa 
gtgaaaatggcttttgtcaagactgtaagctcgagatttctcaaacttctcttggctaa 
tgactccaaccaaagtctttgggagagtgggaagcttgagagcagtatgttgcctcaga 
tccaaatacgtctcaaaaacgtacatcactatcataaaaccaacgacggtttccatgaa 
aggaatcgccatcccctcgaatttccccgatcgttcaaacatttggcaataaagtttct 
taagattgaatcctgttgccggtcttgcgatgattatcatataatttctgttgaattac 
gttaagcatgtaataattaacatgtaatgcatgacgttatttatgagatgggtttttat 
gattagagtcccgcaattatacatttaatacgcgatagaaaacaaaatatagcgcgcaa 
actaggataaattatcgcgcgcggtgtcatctatgttactagatcgggaattcactggc 
cgtcgttttacaacgtcgtgactgggaaaaccctggcgttacccaacttaatcgccttg 
cagcacatccccctttcgccagctggcgtaatagcgaagaggcccgcaccgatcgccct 
tcccaacagttgcgcagcctgaatggcgcccgctcctttcgctttcttcccttcctttc 
tcgccacgttcgccggctttccccgtcaagctctaaatcgggggctccctttagggttc 
cgatttagtgctttacggcacctcgaccccaaaaaacttgatttgggtgatggttcacg 
tagtgggccatcgccctgatagacggtttttcgccctttgacgttggagtccacgttct 
ttaatagtggactcttgttccaaactggaacaacactcaaccctatctcgggctattct 
tttgatttataagggattttgccgatttcggaaccaccatcaaacaggattttcgcctg 
ctggggcaaaccagcgtggaccgcttgctgcaactctctcagggccaggcggtgaaggg 
caatcagctgttgcccgtctcactggtgaaaagaaaaaccaccccagtacattaaaaac 
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gtccgcaatgtgttattaagttgtctaagcgtcaatttgtttacaccacaa ta ta tec t 
geca 



SEQ ID NO: 52 is the nucleic acid sequence of pRD29A-antisense-BnCPP. 
Italicized sequences are the right and left border repeats. Underlined sequence is the 
RD29A promoter. Sequence in bold is the BnCPP antisense sequence. 

SEQ ID NO:53 

gtttacccgccaatata tcctgtcaaacactgatagtttaaactgaaggcgggaaacga 
caatctgatcatgagcggagaattaagggagtcacgttatgacccccgccgatgacgcg 
ggacaagccgttttacgtttggaactgacagaaccgcaacgttgaaggagccactcagc 
cgcgggtttctggagtttaatgagctaagcacatacgtcagaaaccattattgcgcgtt 
caaaagtcgcctaaggtcactatcagctagcaaatatttcttgtcaaaaatgctccact 
gacgttccataaattcccctcggtatccaattagagtctcatattcactctcaatccaa 
ataatctgcaccggatctggatcgtttcgcatgattgaacaagatggattgcacgcagg 
ttctccggccgcttgggtggagaggctattcggctatgactgggcacaacagacaatcg 
gctgctctgatgccgccgtgttccggctgtcagcgcaggggcgcccggttctttttgtc 
aagaccgacctgtccggtgccctgaatgaactgcaggacgaggcagcgcggctatcgtg 
gctggccacgacgggcgttccttgcgcagctgtgctcgacgttgtcactgaagcgggaa 
gggactggctgctattgggcgaagtgccggggcaggatctcctgtcatctcaccttgct 
cctgccgagaaagtatccatcatggctgatgcaatgcggcggctgcatacgcttgatcc 
ggctacctgcccattcgaccaccaagcgaaacatcgcatcgagcgagcacgtactcgga 
tggaagccggtcttgtcgatcaggatgatctggacgaagagcatcaggggctcgcgcca 
gccgaactgttcgccaggctcaaggcgcgcatgcccgacggcgatgatctcgtcgtgac 
ccatggcgatgcctgcttgccgaatatcatggtggaaaatggccgcttttctggattca 
tcgactgtggccggctgggtgtggcggaccgctatcaggacatagcgttggctacccgt 
gatattgctgaagagcttggcggcgaatgggctgaccgcttcctcgtgctttacggtat 
cgccgctcccgattcgcagcgcatcgccttctatcgccttcttgacgagttcttctgag 
cgggactctggggttcgaaatgaccgaccaagcgacgcccaacctgccatcacgagatt 
tcgattccaccgccgccttctatgaaaggttgggcttcggaatcgttttccgggacgcc 
ggctggatgatcctccagcgcggggatctcatgctggagttcttcgcccacgggatctc 
tgcggaacaggcggtcgaaggtgccgatatcattacgacagcaacggccgacaagcaca 
acgccacgatcctgagcgacaatatgatcgggcccggcgtccacatcaacggcgtcggc 
ggcgactgcccaggcaagaccgagatgcaccgcgatatcttgctgcgttcggatatttt 
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cgtggagttcccgccacagacccggatgatccccgatcgttcaaacatttggcaataaa 
gtttcttaagattgaatcctgttgccggtcttgcgatgattatcatataatttctgttg 
aattacgttaagcatgtaataattaacatgtaatgcatgacgttatttatgagatgggt 
ttttatgattagagtcccgcaattatacatttaatacgcgatagaaaacaaaatatagc 
gcgcaaactaggataaattatcgcgcgcggtgtcatctatgttactagatcgggcctcc 
tgtcaatgctggcggcggctctggtggtggttctggtggcggctctgagggtggtggct 
ctgagggtggcggttctgagggtggcggctctgagggaggcggttccggtggtggctct 
ggttccggtgattttgattatgaaaagatggcaaacgctaataagggggctatgaccga 
aaatgccgatgaaaacgcgctacagtctgacgctaaaggcaaacttgattctgtcgcta 
ctgattacggtgctgctatcgatggtttcattggtgacgtttccggccttgctaatggt 
aatggtgctactggtgattttgctggctctaattcccaaatggctcaagtcggtgacgg 
tgataattcacctttaatgaataatttccgtcaatatttaccttccctccctcaatcgg 
ttgaatgtcgcccttttgtctttggcccaatacgcaaaccgcctctccccgcgcgttgg 
ccgattcattaatgcagctggcacgacaggtttcccgactggaaagcgggcagtgagcg 
caacgcaattaatgtgagttagctcactcattaggcaccccaggctttacactttatgc 
ttccggctcgtatgttgtgtggaattgtgagcggataacaatttcacacaggaaacagc 
tatgaccatgattacgccaagct gggaaatttttcgccagttctaaatatccggaaacc 
tcttgggatgccattgcccatctatctgtaatttattgacgaaatagacgaaaaggaag 
gtggctcctataaagcacatcattgcgataacagaaaggccattgttgaagatacctct 
gctgacattggtccccaagtggaagcaccaccccatgaggagcaccgtggagtaagaag 
acgttcgagccacgtcgaaaaagcaagtgtgttgatgtagtatctccattgacgtaagg 
gatgacgcacaatccaactatccatcgcaagaccattgctctatata agaaagttaata 
tcatttcgagtggccacgctgagggggatccatggcgattcctttcatggaaaccgtcg 
ttggttttatgatagtgatgtacgtttttgagacgtatttggatctgaggcaacatact 
gctctcaagcttcccactctcccaaagactttggttggagtcattagccaagagaagtt 
tgagaaatctcgagcttacagtcttgacaaaagccattttcactttgttcatgagtttg 
ttactatacttatggactctgcgattctgttctttgggatcttgccttggttttggaag 
atatctggcggctttctaccaatggtgggactcgatccagagaatgaaatcctgcacac 
tctttcattcttggctggtcttatgacatggtcacagatcactgatttgccattttctt 
tgtactcaactttcgtgatcgagtctcggcatgggttcaacaaacaaacaatatggatg 
ttcattagggacatgatcaaaggaatactcctctctgtcatacctgcccctcctatcgt 
tgccgcaattattgttatagttcagaaaggaggtccttacctcgccatctatctgtggg 
cattcatgtttatcctgtctctagtgatga-tgactatataccctgttttgattgcacct 
cttttcaacaagttcactcctcttcctgatggagacctccgggagaagattgagaaact 
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tgcttcttctctaaagtttcctctgaagaagctgtttgttgtcgatggatctacaaggt 
caagccatagtaatgcttacatgtatggtttcttcaagaacaaaaggattgttctttat 
gacacattgattcagcagtgccagaatgagaatgaaattgtggcggttattgcacacga 
gctgggacactggaagctgaatcacactacatactcgttcattgctgttcaaatccttg 
ccttcttgcaatttggaggatacactcttgtcagaaactccactgatctcttcaggagt 
tttggttttgatacacaaccagttctcattggtttgatcatatttcagcacactgtaat 
accacttcaacacctagtaagctttgacctcaaccttgttagtcgagcgtttgagtttc 
aggctgatgcttttgcagtgaatcttggttatgcaaaggatctacgtcctgccctagtg 
aagctacaggaagagaacttatcagcgatgaacacagacccattgtactcagcttatca 
ctactcacaccctcctcttgtagagaggcttcgagccattgatggagaagacaagaaga 
cagattaacccctcgaatttccccgatcgttcaaacatttggcaataaagtttcttaag 
attgaatcctgttgccggtcttgcgatgattatcatataatttctgttgaattacgtta 
agcatgtaataattaacatgtaatgcatgacgttatttatgagatgggtttttatgatt 
agagtcccgcaattatacatttaatacgcgatagaaaacaaaatatagcgcgcaaacta 
ggataaattatcgcgcgcggtgtcatctatgttactagatcgggaattcactggccgtc 
gttttacaacgtcgtgactgggaaaaccctggcgttacccaacttaatcgccttgcagc 
acatccccctttcgccagctggcgtaatagcgaagaggcccgcaccgatcgcccttccc 
aacagttgcgcagcctgaatggcgcccgctcctttcgctttcttcccttcctttctcgc 
cacgttcgccggctttccccgtcaagctctaaatcgggggctccctttagggttccgat 
ttagtgctttacggcacctcgaccccaaaaaacttgatttgggtgatggttcacgtagt 
gggccatcgccctgatagacggtttttcgccctttgacgttggagtccacgttctttaa 
tagtggactcttgttccaaactggaacaacactcaaccctatctcgggctattcttttg 
atttataagggattttgccgatttcggaaccaccatcaaacaggattttcgcctgctgg 
ggcaaaccagcgtggaccgcttgctgcaactctctcagggccaggcggtgaagggcaat 
cagctgttgcccgtctcactggtgaaaagaaaaaccaccccagtacattaaaaacgtcc 
gcaat gt gt t at taagttgtctaagcgtcaatttgt t ta caeca caatatatcctgcca 

SEQ ID NO:53 is the nucleic acid sequence of MuA-BnCPP. Italicized sequences 
are the right and left border repeats. Underlined sequence is the MuA promoter. 
Sequence in bold is the BnCPP sense sequence. 

Example 5. Southern Analysis 

Genomic Southern blot analysis of transgenic Arabidopsis was performed using 
standard techniques known to one skilled in the art. Typically, 10p,g of DNA was 
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electrophoresed in a 0.8% agarose gel and transferred to an appropriate membrane such 
as Hybond N+ (Amersham Pharmacia Biotech). Pre-hybridization and hybridization 
conditions were as suggested by the membrane manufacturer, typically at 65°C. The 
final stringency wash was typically at 1XSSC and 0.1% SDS at 65°C. The NPTII coding 
5 region was typically used as the radiolabeled probe in Southern blot analysis. 

Thirty-seven Arabidopsis lines were selected as homozygous pBI121-AtCPP 
over-expression lines for further examination. Figure 3 shows a representative blot 
confirming the presence of the pBI121-AtCPP transgene. Lines were confirmed to be 
transgenic by PCR analysis using transgene specific primers in the PCR assays. 

10 Thirty-three Arabidopsis lines were selected as homozygous pBI121-HP-AtCPP 

hair-pin down-regulation lines for further examination. Figure 4 shows a representative 
blot confirming the presence of the pBI121-HP-AtCPP hair-pin construct. All lines were 
confirmed to be transgenic by PCR analysis using transgene specific primers in the PCR 
assays. 

15 Arabidopsis lines were selected as homozygous pRD29A-AtCPP over-expression 

lines for further examination. Figure 5 shows a representative blot confirming the 
presence of the pRD29A-AtCPP transgene. Lines were confirmed to be transgenic by 
PCR analysis using transgene specific primers in the PCR assays. 

Arabidopsis lines were selected as homozygous pRD29A-HP-AtCPP lines for 
20 further examination. Figure 6 shows a representative blot confirming the presence of the 
pRD29A-HP-AtCPP transgene. Lines were confirmed to be transgenic by PCR analysis 
using transgene specific primers in the PCR assays. 



Example 6: PCR analysis of transgenic plants 

PCR was used as a method to confirm the presence of the transgene in all 
25 transgenic lines and every construct.. Typical PCR mixtures contained: IX reaction 
buffer (lOmM Tris-HCl pH 8.8, 1.5mM MgCl 2> , 50mM KC1), dNTPs at 200|liM, lpM 
forward and reverse primer, 2.5U. Taq DNA polymerase, and template plus water to a 
final volume of 50jaL. Reactions were run at 1 minute 94°C, 1 minute 60°C, 1 minute 
72°C, for 30 cycles. Primers used in the analysis of pBI121-AtCPP and pBI121-HP- 
30 AtCPP transgenic plants were as shown in Table 8. Primers used in the analysis of 

pRD29A-AtCPP were RD29AP1 (SEQ ID NO:66 ) and SEQ ID NO:7. Primers used in 
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the analysis of pRD29A-HP-AtCPP transgenic plants were those identified as RD29AP1 
(SEQ ID NO:66), SEQ ID NO:8 and SEQ ID NO:8, Nosterm-RV (SEQ ID NO:67 ). 

Table 8. 

5 pBI121-AtCPPBamFW: 5 '-GCCGAC AGTGGTCCC AAAG ATGG-3 ' 
(SEQ ID NO: 10) 

p35S-AtCPP SmaRV: 5'-AAACCCGGGTTAATCTGTCTTCTTGTCTTCTCCA-3' 
(SEQ ID NO:7) 

p35S-HP-AtCPP BamFW: 5'-CTGGAGCTCTTTTACCGAGGTTGGGCCTTGATCC-3' 
10 (SEQ ID NO: 8) 

p35S-HP-AtCPP SmaRV: 5-GCAAGACCGGC AAC AGGA-3 1 
(SEQ ID NO: 13) 

pRD29AP 1 : 5'-TTTAAGCTTGGAGCC ATAGATGC AATTC AA -3' 

(SEQ ID NO:66) 

15 pRD29APl: 5 '-TTTAAGCTTGGAGCC ATAGATGC AATTC AA -3 ' 

(SEQ ID NO:66) 

Nosterm-RV: 5'-GC AAGACCGGC AAC AGGA-3' 

(SEQ ID NO:67) 

Example 7: Northern analysis of transgenic plants 

20 Total RNA was isolated from developing leaf tissue of 27 35S-AtCPP 

Arabidopsis lines (T3 plants). Approximately 10 ug of total RNA was loaded into each 
lane. The Northern blot was first probed with P 32 labeled, single-stranded antisense 
transcript of AtCPP which detects sense transcript, then stripped and re-probed with 
cDNA of P-tubulin that was used as a reference. The hybridizing bands of AtCPP and P- 

25 tubulin were scanned and quantified using the UN-Scan-It programme (Silk Scientific, 
Utah, USA), and the ratio of the two hybridizing bands for each sample was obtained. 
The ratio of the wild type plants was set to 100%, and was compared with those of the 
transgenic lines. Twenty-one out of twenty-seven lines showed higher expression of 
AtCPP transcript as compared to the wild type. Values ranged from 104 % to 282 % of 
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wild type. The results of five lines (35, 84, 76, 136, and 156) of the 21 over-expressing 
lines is shown in Figure 7. 



Example 8: Production of polyclonal antibodies against AtCPP 

Anti-AtCPP antibodies were generated using AtCPP fusion protein over-expressed 
5 in E. coli. The over-expression vector, pMAL-p2, contains 1 1 75 bp malE gene that is 
located upstream of AtCPP and encodes a 43 KDa maltose-binding protein (MBP). The 
1275 bp BamUl/Smal DNA fragment of AtCPP was inserted into pMAL-p2 at BamUl 
and Sail sites. The Sail site was converted into blunt end using Klenow fragment. The 
resulting fusion protein MBP-AtCPP was then over-expressed in DH5a, and purified by 

10 one-step affinity for MBP as described by the manufacturer (New England Biolab). The 
soluble fraction of the crude bacterial extract containing the MBP-AtCPP fusion protein 
was loaded to a amylose column (1.5 cm x 10.0 cm), and the proteins were eluted with 10 
mM maltose in column buffer (50 mM Tris-HCl, pH 7.5, 1 mM EDTA, and 200 mM 
NaCl). Fractions containing purified MBP-AtCPP fusion protein were pooled, and 

15 concentrated with a Centriprep-30 concentrator (Amicon). All purification steps were 
carried out at 4°C. To generate an antibody, the purified fusion protein was further 
separated by SDS-PAGE and the Coomassie stained band corresponding to the fusion 
protein was excised. The identity of the fusion protein was confirmed by Western 
analysis using anti-MBP antibodies (purchased from New England Biolab). The protein 

20 was eluted from the gel slice by electroelution and then emulsified in Ribi adjuvant (Ribi 
Immunochem) to a final volume of 1 ml. MBP-AtCPP protein was injected into a 3 kg 
New Zealand rabbit on day 1 and booster injections were given on day 21 and day 35 with 
175 |ig of the protein each time. High-titer antisera were obtained one week after the final 
injection. 

25 Example 9: Western blot analysis of 35S-AtCPP transgenic lines using Anti-AtCPP 
antibodies. 

Western analysis was performed to examine expression level of AtCPP in the 

transgenic lines compared with that of wild type plants. Anti-Bip antibody, an ER 

lumenal protein (Stressgen, Victoria, BC, Canada ) was used as a reference. Total 

30 proteins were extracted from developing leaf tissue of five ABA S lines and a wild type 

control.. The antigenic protein bands of AtCPP and Bip were scanned and quantified 

using the UN-Scan-It programme (Silk Scientific, Utah, USA) and the ratio of the two 
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protein bands for each sample was obtained. The ratio of the wild type plants was set to 
100%, and was compared with those of the transgenic lines. Data is presented in Figure 
7 indicating that the AtCPP protein level was increased in the transgenic lines compared 
to the wild type plants. 

5 Example 10: ABA sensitivity of transgenic seedlings. 

Approximately 100 seeds were assessed per line per 9 cm plate. Seeds were 
plated on minimal medium (1/2 MS) supplemented with no ABA or 1 .0 |iM ABA. Plates 
were chilled for 3 days at 4 °C in the dark, and incubated for up to 2 1 days at 22 °C with 
24 hour continuous light. Plates were assessed for germination, cotyledon expansion, 

10 true leaf development and seedling vigor. Seedlings were assessed for ABA sensitivity 
over 21 days of growth at which time sensitive seedlings were arrested at the cotyledon 
stage, lacked true leaves, and showed inhibition of root growth. Wild type control 
Columbia plants had two to three pairs of true leaves and a well developed root system. 
Lines were categorized as ABA sensitive (ABA S ) if less than 1% of plants looked like 

15 control, moderately ABA sensitive (ABA MS ) if more than 1% but less than 50% of 

looked like control, or ABA insensitive (ABA Wt ) if greater than 50% looked like control. 

For example, if a plate had 20 healthy seedlings and the control plate had 60 
healthy seedlings, the line would be 33% of control and categorized as moderately ABA 
sensitive. 

20 All four vector constructs (pBI 1 2 1 - AtCPP, pBI 1 2 1 Hp- AtCPP, pRD29 AHp- 

AtCPP, pRD29A- ATCPP) have resulted in transgenic lines of Arabidopsis which have 
increased sensitivity to ABA which is indicative of stress tolerance. The data for all 4 
constructs is shown in Figure 8. Of the lines transformed with the pBIl 21 -AtCPP 
construct to over-express the AtCPP gene, 58% (21 out of 36) were classified as 

25 sensitive and an added 30% (1 1 out of 36) were classified as moderately sensitive. These 
lines were tested again in T4 and T5 generations and their ABA sensitivity was still 
present indicating that ABA sensitivity is an inheritable trait. Of the lines transformed 
with the pBI121-HP-AtCPP construct to down-regulate the AtCPP gene by double 
stranded RNA-inhibition, 15% (7 out of 45) were classified as sensitive and 31% (14 out 

30 of 45) were classified as moderately sensitive. To illustrate the increased sensitivity of 
transgenic lines to ABA, Figure 9 shows the results of germination and seedling 
development over a range of ABA concentrations. Wild type and pRD29A-HP- AtCPP 
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are compared. Of the lines transformed with pRD29AHp-AtCPP 70% (12 out of 17) 
showed high sensitivity and 24% (4 out of 1 7) showed moderate sensitivity to ABA. Of 
the lines transformed with pRD29A-AtCPP 29% (5 out of 17) showed high sensitivity 
and 12% (2 out of 17) moderate sensitivity to ABA. Clearly all 4 transgene constructs 
5 are altering ABA sensitivity and ABA signal transduction. 

Example 11: Drought Experiments 

Arabidopsis plants were grown five plants per 4" or 3" pot, in a replicated water- 
stress experiment. All pots were filled with equal amounts of homogeneous premixed 
and wetted soil. Plants were grown under 16 hour daylight (150-200 jamol/m 2 /s) at 22 °C 

10 and 70% relative humidity. On the day that the first flower opened drought treatment was 
initiated. First soil water content in each pot was equalized on a weight basis and any 
further watering of plants was stopped. Daily measurements of soil water content were 
taken by recording total pot weight. At the end of the drought treatment (6 to 9 days for 
experiments in 4" pots and 4-5 days for experiments in 3 " pots) plants were harvested 

15 and shoot dry weights determined. Differences in plant growth were factored into the 
analysis by expressing water loss on a per gram shoot dry weight basis. 



11a) pBI121-AtCPP, Drought stress screen: 

Analysis of pBI121-AtCPP transgenic lines during water-stress treatment 
experiments of up to an eight day period, shows a strong trend towards increased soil 

20 water content and reduced water loss per gram of shoot biomass. After three days of 
water-stress treatment most lines had increased soil water content relative to the wild 
type control with four out of twenty- four lines, 146, 149, 156 and 97, showing a 
statistically significant difference. The amount of water lost per gram of shoot biomass 
was lower for all lines except one (95), and thirteen of these lines were significantly 

25 different from the wild type Columbia control (Figure 1 0). All of the lines showing a 
statistically significant lower water loss per gram shoot biomass also showed an 
increased ABA sensitivity. There is also a strong trend, for all but one line (95), which is 
ABA Wt , towards greater shoot biomass at the end of the drought stress treatment. Seven 
of those lines 136, 146, 23, 46, 76, 84 and 9, were statistically significant from control at 

30 a p=0.05 value. 
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lib) pBI121-AtCPP, Water loss per gram shoot biomass during water stress 
treatment: 

Lines 35, 76, 95 and a wild type control were grown and placed under a water- 
stress treatment as above. Plants were harvested at 2 days, 4 days and 6 days of drought 
5 treatment. The ABA S lines, 35 and 76, showed a statistically significant reduction in 

water-loss relative to shoot dry weight at all three time points (Table 9). Additionally, the 
two ABA S transgenic lines had increased shoot biomass, due to increased leaf biomass, 
and maintained higher soil water contents during drought treatment. 

Table 9. Water loss (g) per Shoot dry weight (g) after 2, 4 and 6 days of drought-stress 
10 treatment. Values in bold indicate statistically significant differences from Columbia. 





2 days 


4 days 


6 days 


Line 


Mean 


Std. Error 


Mean 


Std. Error 


Mean 


Std. Error 


35 


212.5 


3.5 


308.0 


9.9 


297.7 


11.2 


76 


227.2 


5.8 


321.2 


8.5 


293.8 


5.0 


95 


287.0 


5.1 


377.3 


14.8 


348.5 


25.5 


Columbia 
Wild type 


265.3 


11.8 


408.2 


7.7 


345.9 


6.7 



lie) pBI121-AtCPP, Drought stress and shoot recovery: 

Water-stress tolerance and determination of post drought-treatment recovery 
ability was assessed using 20 of the 24 pBI121-AtCPP transgenic lines. Drought 

15 treatment was imposed for 6 days after which the plants were watered and allowed to 
grow for 6 days. Recovered shoot fresh biomass was then determined. Soil water 
content of these plants was measured daily during the drought treatment and the results 
confirm previously seen trends. All ABA sensitive (ABA S ) lines that showed a 
statistically significantly reduction of water loss on a per gram dry weight basis in 

20 experiment 11a, continued to show a significant greater soil water content than control 
plants in this experiment (Table 10). Additionally, Table 10 shows that the recovered 
shoot fresh biomass after 6 days of drought treatment was significantly greater in all the 
ABAs lines than Columbia. 
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TablelO. Soil water content on day 3 of drought treatment and recovered shoot fresh 
weight after 6 days of drought treatment (values in bold were significantly different from 
Columbia at p=0.05) 





ABA status 


soil water content day 3 


recovered shoot biomass 


Line 


ABA 


Mean (% 
initial) 


Std Error 


Mean (g) 


Std Error 


136 


ABA S 


46.6 


1.9 


4.5 


0.16 


14 


ABA S 


50.25 


0.7 


4.1 


0.12 


146 


ABA S 


45.9 


2.5 


4.0 


0.11 


147 


ABA 


45.1 


1.7 


4.0 


0.15 


149 


ABA 


45.3 


1.8 


3.8 


0.17 


156 


ABA 


47.1 


1.9 


4.0 


0.134 


23 


ABA S 


49 


1.4 


4.0 


0.17 


33 


ABA* 


46.9 


1.6 


4.3 


0.14 


35 


ABA* 


41.7 


1.7 


4.0 


0.11 


46 


ABA* 


44.8 


1.7 


3.8 


0.09 


63 


ABA* 


46.3 


1.4 


4.0 


0.19 


76 


ABA* 


47.8 


1.0 


3.9 


0.17 


79 


ABA* 


45.4 


1.1 


4.1 


0.09 


84 


ABA* 


46.8 


1.9 


4.1 


0.16 


85 


ABA* 


45.3 


1.9 


4.0 


0.12 


9 


ABA 


45.2 


2.1 


3.9 


0.12 


93 


ABA wt 


43.5 


1.2 


2.8 


0.07 


94 


ABA S 


46.9 


1.5 


3.9 


0.13 


97 


ABA S 


53 


1.2 


3.8 


0.16 


95 


ABA wt 


41.9 


1.2 


2.7 


0.06 
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Columbia 


ABA Wt 


41.3 


1.0 


2.7 


0.04 



lid) pBI121-AtCPP, Seed yield after drought stress treatment: 

Seed yield after drought stress during flowering was examined using ten pBI121- 

5 AtCPP transgenic lines, eight of which were ABA S . Plants were grown one per 4" pot 
and were exposed to 9 days of drought treatment as described above. A second group of 
plants was grown and maintained under well watered conditions as the optimal group. 
After 9 days of drought treatment plants were re-watered and allowed to continue growth 
and seed set to maturity. After drought-treatment conditions all eight ABA S lines had 

10 increased yields relative to controls, which ranged from 109% to 126% of the Columbia 
(Table 1 1). Drought-treatment resulted in a reduction of yield in all lines, including 
controls, relative to plants grown under optimal conditions. Expression of the seed yields 
obtained from drought-treated group relative to the same line under optimal conditions 
shows that the transgenics preserve a larger percentage of optimal seed yield than do 

15 wild type lines. 



Table 11. Seed Yield following 9 days drought-treatment 





ABA status 


Seed Yield (g per plant) 






Line 


ABA 


Mean (g) 


Std Error 


% Columbia 


% Optimal 


156 


ABA S 


0.735 


0.044 


126.2 


83.7 


63 


ABA S 


0.675 


0.061 


116.0 


71.0 


146 


ABA S 


0.666 


0.053 


114.4 


72.9 


94 


ABA S 


0.644 


0.052 


110.6 


68.8 


84 


ABA 54 


0.642 


0.049 


110.4 


61.8 


76 


ABA S 


0.631 


0.055 


108.5 


66.6 


136 


ABA S 


0.630 


0.051 


108.3 


74.1 


35 


ABA S 


0.614 


0.054 


105.6 


74.2 
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93 


ABA Wt 


0.567 


0.041 


97.5 


60.0 


95 


ABA Wt 


0.388 


0.088 


66.7 


43.4 


Columbia 


ABA wt 


0.582 


0.060 


100 


53.8 



lie) pBI121-AtCPP, Seed yield and growth under optimal water conditions: 

The lines evaluated above and a number of additional lines were examined in a 
5 growth and yield experiment under optimal, well-watered conditions. Results indicated 
that the ABA lines were shorter at the stage of first open flower, had more rosette 
leaves, however, by maturity there were no differences in plant height of transgenics and 
Columbia. Moreover, the ABA S transgenics showed similar or higher seed yields ranging 
from 95% to 121% of the wild type control (Figure 1 1). 

10 llg) pRD29A-HP-AtCPP screen for drought tolerant phenotype: 

Analysis of 17 transgenic lines identified 7 candidate drought tolerant lines (12, 22, 23, 
47, 82, 83, 90) on the basis of higher soil water content and lower water loss per g of 
shoot dry weight (Table 12). All 7 drought tolerant candidate lines showed strong ABA 
sensitivity and lines that did not show drought tolerance did not show ABA sensitivity. 

15 Table 12. Soil water content after 3 days of drought treatment and water lost per g shoot 
dry weight. Values in bold are statistically different from those of Columbia wild type 
(p=0.05) 





ABA status 


soil water content day 2 


water lost in 2days/g 
shootDW 


Line 


ABA , 


Mean (% 
initial) 


Std Error 


Mean (g/g) 


Std Error 


10 


ABA S 


33.4 


1.6 


199.1 


4.5 


11 


ABA S 


34.6 


3.3 


173.1 


1.6 


12 


ABA S 


36.2 


2.0 


179.5 


5.0 


126 


ABA MS 


32.5 


2.6 


199.1 


4.1 
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127 


ABA MS 


33.5 


2.0 


195.6 


10.6 


14 


ABA 5 


32.7 


1.2 


203 


4.9 


17 


ABA S 


29.9 


1.8 


200.7 


7.3 


22 


ABA 5 


39.3 


2.1 


170.0 


3.0 


23 


ABA 5 


35.7 


1.4 


174.9 


2.6 


42 


ABA M5 


28 


0.7 


185.4 


5.8 


47 


ABA 5 


35.9 


2.2 


181.2 


7.7 


7 


ABA Wt 


35 


1.3 


201.8 


5.1 


82 


ABA 5 


36.7 


2.2 


178.3 


4.0 


83 


ABA 5 


40 


1.4 


180.7 


6.9 


9 


ABA 5 


31.4 


1.4 


173.8 


8.7 


90 


ABA 5 


38.2 


1.3 


177.6 


6.2 


93 


ABA Wt 


30.7 


1.8 


175.3 


4.6 


Columbia 


ABA wt 


32.1 


1.2 


196.9 


6.2 



Example 12. Growth Analysis 

The growth analysis of most promising constructs has been set up at 3 stages. 
Eight plants per line were grown in 3" pots with one plant per pot at 22C, 1 6hr light 
5 (150-200 jamol/m 2 /s) and 70% RH. Plants were harvested at vegetative growth stage (2 
week old seedlings), bolting growth stage (at first open flower) and mid-flowering 
growth stage (5 to 7 days from first open flower). Also, in some growth experiments 
additional group of plants was grown in 4" pots (one per pot and 10 plants per line) to 
maturity for seed yield determinations. 

10 12a) pBI121-AtCPP growth under optimal and biotic stress conditions 

The growth and productivity of pBI121-AtCPP transgenic Arabidopsis lines was 
examined at several stages of development under optimal growth conditions. Although 
optimal growth conditions were maintained, plants were assessed to be under a degree of 
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stress that was later determined to be a result of the soil properties. Soil analysis found a 
fungal contaminant that was believed to be responsible for the biotic stress. This stress 
could be negated by sterilization of the soil prior to use. Eight ABA S lines, two with 
normal ABA sensitivity (ABA wt ) and a wild type Columbia control were analyzed. 
5 Figure 12 presents the results of various growth (from mid-flowering stage) and 

yield parameters and each trait is expressed as a percentage of the Columbia control. 
The results strongly support an enhanced growth phenotype. This enhanced growth 
phenotype is present at all growth stages. At the vegetative stage, all ABA S transgenic 
plants showed an increase in leaf number relative to that of the wild type with four of the 

10 eight lines showing a statistically significant difference. The two ABA wt lines showed 
the same or fewer leaves relative to wild type. 

At the bolting stage ABA S transgenics showed an increase in leaf number but 
plants were shorter at this stage (first open flower) than controls. The shoot fresh weight 
of transgenics was significantly increased relative to that of controls, ranging from 80% 

15 to 342% of the wild type. The ABA S transgenics displayed a delay in flowering from one 
to three days. The ABA wt transgenics did not show delayed flowering, increased shoot 
fresh weight or increased height. 

At the flowering stage of development the enhanced growth phenotype is 
maintained (greater leaf number and fresh weight), however, there were no observable 

20 differences in plant height indicating that transgenics bolt shorter but reach same final 
plant height. 

Of particular significance is the observation, that under these conditions (biotic 
stress due to presence of fungi in the soil) yields of the ABA S transgenics were 
significantly higher, ranging from 120% to 229% of the wild type control. The ABA wt 

25 lines showed similar or slightly reduced yields relative to the Columbia control. This 
finding indicates that ABA S transgenic lines are affected less by the biotic stress. This 
observation has been confirmed, where 5 of the drought tolerant lines were grown in 
contaminated soil to maturity. The seed yields of transgenic lines, even though greatly 
reduced relative to optimal conditions, were 2.5 to 4.5 fold higher than those of 

30 Columbia wild type (Table 13). 

Table 13. Seed yield of pBI121-AtCPP lines grown in contaminated soil. Values in 
bold indicate statistical differences at p=0.05 
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Line 


ABA 

sensitivity 


Seed Yield per plant 

(g) 


% of Columbia 


156 


ABA S 


0.33 ± 0.04 


316% 


23 


ABA S 


0.35 ± 0.05 


336% 


76 


ABA S 


0.31 ± 0.04 


296% 


84 


ABA S 


0.25 + 0.33 


237% 


9 


ABA 5 * 


0.48 ± 0.05 


455% 


Columbia 


ABA wt 


0.11 ±0.03 





12b) pBI121-AtCPP early seedling growth: 

Four ABA S and one ABA wt line plus Columbia were examined for early seedling 
5 growth on agar plates. Twenty seeds were plated in a line on agar plates containing 50% 
MS with 1% sucrose and vitamins and 6 plates per line were used. Plates were placed on 
slants, which allowed roots to grow downwards. Root length was measured on 7-day old 
seedlings and shoot and root biomass determined on 1 1-day old seedlings. Two of the 
ABA S transgenic lines had significantly longer roots and all 4 ABA S lines had shoot dry 

10 weights 1 14% to 123% of controls and root dry weights of 116% to 151% of controls. As 
a result, the shoot biomass to rootbiomass ratios were slightly reduced in transgenics. 
These results indicate that enhanced growth of these transgenics is evident in the early 
growth stage, shortly after germination, and the root growth is more enhanced relative to 
shoot growth. In a different experiment seedlings were pulled out of agar and roots were 

15 stained with toluidine blue to show their structure. Figure 13 shows that transgenic lines 
had more extensive lateral root system, which would account for greater root biomass. 

12c) pRD29A-HP-AtCPP optimal growth characteristics 

An optimal growth study has been conducted with 10 lines as described before. 
Vegetative growth data showed that two of the lines (12 and 9) had significantly more 
20 leaves and seven of the lines (12, 22, 23, 47, 82, 9) had significantly greater shoot 

biomass. Bolting data showed that eight of the lines (12, 22, 23, 47, 82, 9, 90, 93) were 
significantly delayed in flowering by one to two days, and seven of the lines were 
significantly shorter than Columbia at first open flower. All of the lines except 42 and 7 
had significantly greater number of rosette leaves and shoot FW and this trend is 
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maintained into the mid-flowering harvest (Figure 14). The plant height, however, by 
mid-flowering harvest was not significantly different between the transgenic lines and 
control. All the lines that showed this enhanced growth also showed drought tolerance 
and ABA sensitivity. 

5 

Example 13. Ultrastructure pBI121-AtCPP 

Two of the drought tolerant and ABA S lines (35 and 76) plus Wt Columbia were 
used to examine stem and root cross-sections for any differences in ultrastructure. Free 
hand sections of mature stems (plants flowering for lOdays) were obtained from above 

10 the first node, stained with toluidine blue and preserved with glycerol. The stems of 
transgenic plants appeared to have more dense cellular structure and contain one or two 
more vascular bundles than those of Columbia Wt indicating more enhanced water and 
nutrient transport system. 

Leaf disks were taken and fresh weights determined. Transgenic leaf disks were 

15 significantly heavier, 20-24% greater than corresponding wild type controls. This 
increase is believed to be as a result of a thicker leaf. 

Example 14. Cold stress experiment pBI121-AtCPP 

Four drought tolerant, ABA S lines (156, 23, 35, 76) and one ABA Wt (95) line plus 
wild type Columbia were included in a cold stress study. Plants were grown in 3" pots 

20 one per pot) with 10 replicate pots per line at 22C for 10 days (7 days on agar plates and 
4 in soil). The cold stress group was moved into 7°C for 5 days while the optimal group 
was left at 22C. After 5 days in the cold both cold stress group and the optimal group 
were harvested for shoot biomass determination. ABA and drought tolerant lines had 
significantly greater shoot biomass than Columbia in both optimal (25 to 39% greater 

25 shoot fresh weight) and cold stress groups (18 to 44% greater shoot DW) (Table 14). 
Results of an eight-day cold stress showed that differences between the transgenic lines 
and Columbia were even more pronounced (53 to 61% greater shoot fresh weight). This 
result indicates greater plant vigor and better ability of transgenics to cope with cold 
stress. 



Table 14. Shoot fresh weight of optimal and cold stressed (5C for 5d) pBI121-AtCPP. 
Values in bold indicate statistical difference at p=0.05 
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Line 


ABA 
sensitivity 


Optimal shoot FW 


Cold stress shoot FW 






mg 


% oi Columbia 


mg 


% ot Columbia 


156 


ABA S 


95 A ± 3.7 


1 J / /o 




1 1 ftoz 
1 lo/o 


23 


ABA 5 


OK 1 -4- 1 O 






1 44% 


35 


ABA 5 


87.0 ± 1.7 


125% 


25.3 1.4 


130% 


76 


ABA 5 


94.7 ± 2.2 


136% 


27.3 1.5 


140% 


95 


ABAWt 


67 ± 2.4 


96% 


21.4 1.0 


109% 


Columbia 


ABAWt 


69 ± 1.9 




19.6 1.1 





Example 15. Drought stress under high temperature pBI121-AtCPP 

A drought stress experiment was conducted as described above except that day 
temperature of 32°C (16hr) and night temperature of 22°C (8hr) was maintained. These 

5 temperatures were achieved daily over a 2hr ramping period. Four ABA S and one 

ABA Wt line plus Columbia were included. Plants were monitored daily for water loss and 
soil water content and after 5 days of drought treatment half of the plants were harvested 
and the other half was re- watered and allowed to recover for four days. Shoots were 
harvested and shoot fresh weight determined. The results (Table 15) of this experiment 

10 showed that previously identified drought tolerant lines maintained their drought tolerant 
phenotype at high temperature and were able to recover well from the drought stress at 
high temperature 

Table 15. Soil water content on day 2 and water lost in 2 days/final shoot dry weight 
15 plus recovery shoot FW after 5days of drought stress at 32C day and 22C night 

temperatures. Values in bold indicate significant differences from the Columbia control. 



line 


ABA 
sensitivity 


soil water 
content day 2 


water lost in 
2d/shoot DW 


recovered shoot 
FW (g) 


136 


ABA S 


50.4 ± 1.1 


485.7 ± 18.5 


1.30 ± 0.04 


146 


ABA* 


52.1 ± 1.0 


504.5 + 7.9 


1.15 ± 0.04 


35 


ABA S 


52.2 ± 0.8 


502.8 ±15.8 


1.19 ± 0.02 


76 


ABA 5 


52.1 ± 0.6 


435.6 ± 10.5 


1.11 ±0.03 
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95 


ABAWt 


50.0 ± 0.9 


518.2 ± 13.0 


0.86 ±0.03 


Columbia 


ABAWt 


48.6 ± 0.6 


559.7 ± 19.0 


0.84 ± 0.03 



Example 16. Heat stress and seed yield pBI121-AtCPP 

Two ABA S lines and one ABA Wt line plus Columbia were examined for the effect 
of heat stress during flowering on the final seed yield. Plants were grown in 4 inch pots 
5 (one/pot) as described above and 9 days from first open flower the temperature was 

ramped from 22 C to 43 C over 2 hours and plants were kept at 43 C for 2hr. Temperature 
was then ramped back to 22C over 2 hours and plants were grown under optimal 
conditions until maturity. The seed yields from this experiment are shown in Table 16. 
One of the drought tolerant lines (35) had significantly greater yield than Columbia. 

10 



Table 16. Seed yield of pBI121-AtCPP lines after two hour 43C heat stress 9 days from 
first open flower. Values in bold are statistically significant from Columbia. 



line 


ABA 
sensitivity 


seed yield (g/plant) 


seed yield (% of col.) 


35 


ABA S 


0.55 ± 0.05 


347% 


76 


ABA S 


0.24 ± 0.03 


148% 


95 


ABAWt 


0.11 ±0.02 


69% 


Columbia 


ABAWt 


0.16 ±0.03 





The effect of heat shock on lines of pBI121-AtCPP at the early flowering stage 
15 was assessed. Three ABA S lines (76, 136, 97) a ABA Wt line (95) and a Columbia wild 
type control were seeded in 128 cell flats, one flat per line. At the early flowering stage 
flats were exposed to a temperature of 46.8°C for 50 minutes and then returned to normal 
growth conditions. Lack of continued growth from main meristems was defined as main 
meristem death and scored for each line. Data is shown in Table 17. 

20 



Table 17. Meristem death due to heat shock 



Line 


Wt 


95 


76 


136 


97 


% Death 


91 


97 


79 


59 


18 
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Example 17. Stomata density determinations pBI121AtCPP 

Two ABA S lines (76 and 35) plus Columbia were examined for stomata density 
on the upper and lower leaf surface. Nail polish imprints of the upper and lower 
5 epidermis were obtained from a fully expanded leaf #5. These imprints were analyzed 
under the microscope and the number of stomata per 8.7 x 10" m were counted. There 
were no significant differences found between transgenics and Columbia in the stomata 
of the upper or lower epidermis (Table 18). The increases seen in drought tolerance and 
reduced water loss is not attributable to a reduced number of leaf stomata. 

10 



Table 18. Stomata numbers per 8.7 x 10 m of abaxial and adaxial epidermis of fully 
expanded leaf #5 in pBI121 AtCPP. 



line 


ABA sensitivity 


stomata on upper 
epidermis 


stomata on lower 
epidermis 


35 


ABA 5 * 


68 + 5 


103 + 7 


76 


ABA 5 


58 ±6 


120+16 


Columbia 


ABAWt 


57 ±6 


116+11 



Example 18. CPP Consensus Sequences 

15 Also included in the invention is the CPP consensus sequences. The consensus 

sequences were generated by alignment of the CPP polypeptide and nucleic acid 
ssequences as well as sequences homogous using the program BioEdit. 

The "x" in the consensus sequence represents any amino acid or nucleotide. 
Preferably "x" a conservative amino acid or nucleotide substitution. More preferably, 
20 "x" is the most amino acid or nucleotide most prevalent at a given postion. For example, 
the amino acid at postion 145 of SEQ ID NO: 73 is a proline as it occurs 66% of the 
time. 

Table 19. ClustalW Analysis of BASF Nucleic Acids 

1) basf_ati (SEQIDNO:21) 

2) basf_at2 (SEQIDNO:23) 

3) BASF-corn (SEQ ID NO:25) 

4) HASF-soy (SEQ ID NO:27) 
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5) Consensus (SEQ ID NO:68) 



10 20 30 40 50 60 

| | | | | | I | I I I I 

BASF_AT1 1 

BASF_AT2 1 

BASF-Corn 1 

BASF- Soy CTAATACGACTCACTATAGGGCAAGCAGTGGTAACAACGCAGAGTACGCGGGGGGAGACG 60 

Consensus XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 60 

70 80 90 100 110 120 

| | | | | | | | | | | I 

BASF_AT1 1 

BASF_AT2 1 

BASF-Corn 1 

BASF-Soy CATGGTTCTGAACTAATTGTTATAAATAATACCTAAAATTTTGAGTTGTCCTAAACATTG 120 

Consensus XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 120 

130 140 150 160 170 180 

j | | j | | | | j | | | 

BASF_AT1 1 

BASF_AT2 1 

BASF-Corn 1 

BASF- Soy GGGTTTAAACAAATCCAATCTCTCAATATAAAACCCAATGATCTCACCCTCACTCCGTTT 180 

Consensus XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 180 



190 200 210 220 230 240 

I | | I I I I I I t ♦ . . . I I 

BASF_AT1 ^pJjEAH 8 

BASF_AT2 ^E-Bfl-ftU 8 

BASF-Corn ________ i 

BASF-Soy CTGATTTCTCACTCTTCGTTTCTCGTTCGGTTCATCAGCGTGTGTCTCAGCC ^^^^ Tfl 24 0 

Consensus XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 240 



250 260 270 280 290 300 

| I I I I I I • . • . I I I ■ . . . I I 

BASF_AT1 g|fflTnT^l|p 68 
BASF_AT2 Bfl TMT fljB gl ^ 68 

BASF-Corn 1 

Consensus XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 300 



BASF_AT1 
BASF_AT2 
BASF-Corn 
BASF- Soy 




330 
. • I ■ . 



340 



350 



CTCAAgCTTCCgACTCT 

ctcaaBcttccBJactct 



CgA^ggAT^GG^Cg 



360 

I 



128 
128 
1 

360 



Consensus XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 3 60 



BASF_ATl 
BASF_AT2 
BASF-Corn 
BASF- Soy 



370 
. . | . . 



380 



390 



400 



410 



AGCC AAG AGAA gTTTGAGAAATC 
AGCCAAGAGAA gTTTGAGAAATC 



AAAAGC 
AAAAGC 



^32__E__ T 



420 



188 
188 
1 

420 



Consensus XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 420 



BASF_AT1 
BASF_AT2 
BASF- Corn 
BASF-Soy 
Consensus 



430 



440 
. . | . . 



450 



460 



470 
. . I . ■ 



GACTCT 
GACTCT 



:TTTGGGi 

:tttggg' 



480 
I 



xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 



248 
248 
1 

480 
480 



BASF_AT1 
BASF_AT2 
BASF-Corn 
BASF-Soy 



490 



TGGTTTTGGAAGA 
TGGTTTTGGAAGA 



500 510 520 

_!__•__• I __■__• l • 

GTTl 
GTT 

Acl 



rGGTTTTGGAAG 



Consensus XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 



550 



560 



570 



580 




540 
. . I 



GAGAA 
GAGAA 
GAGAA 
GAGAA 
GAGAA 



308 

308 

20 

540 

540 



600 
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BASF_ ATI 

BASF_AT2 

BASF-Corn 

BASF-Soy 

Consensus 



BASF_AT1 
BASF_AT2 
BASF- Corn 
BASF- Soy 
Consensus 



BASF__AT1 

BASF_AT2 

BASF-Corn 

BASF-Soy 

Consensus 



BASF_AT1 
BASF_AT2 
BASF- Com 
BASF- Soy 
Consensus 




BASF_AT1 
BASF_AT2 
BASF-Corn 
BASF- Soy 
Consensus 




790 
. . I . ■ 



800 



810 



820 



830 



TGGG^HTT 

tgggSHtt 
tgggSJtt 

TGGGOTTT 
TGGG^STT 



840 




BASF_AT1 
BASF_AT2 
BASF-Corn 
BASF- Soy 
Consensus 



850 



860 



870 



880 



890 



900 




910 



920 



930 



940 



950 
. . | . . 



960 



BASF_AT1 

BASF_AT2 

BASF-Corn 

BASF-Soy 

Consensus 



BASF_AT1 
BASF_AT2 
BASF-Corn 
BASF- Soy 
Consensus 



BASF_AT1 
BASF_AT2 
BASF-Corn 
BASF- Soy 
Consensus 



BASF_AT1 

BASF_ J AT2 

BASF-Corn 

BASF-Soy 

Consensus 
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BASF_AT1 
BASF_AT2 
BASF-Corn 
BASF- Soy 
Consensus 



BASF_AT1 
BASF_AT2 
BASF-Corn 
BASF- Soy 
Consensus 



BASF_AT1 
BASF_AT2 
BASF-Corn 
BASF- Soy 
Consensus 



BASF_AT1 

BASF_AT2 

BASF-Corn 

BASF-Soy 

Consensus 



BASF_AT1 
BASF_AT2 
BASF-Corn 
BASF- Soy 
Consensus 



BASF_AT1 
BASF_AT2 
BASF-Corn 
BASF- Soy 
Consensus 



BASF_AT1 

BASF_AT2 

BASF-Corn 

BASF-Soy 

Consensus 



BASF_AT1 
BASF_AT2 
BASF-Corn 
BASF- Soy 
Consensus 




XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1500 
1510 1520 1530 1540 1550 1560 



I 



I 



I 



| | I | | | | | . 

SMttHA 127 5 

^^TtBa 127 5 

IS9Sa gBtTAGTCGATCCTTGTATGAGGTTTACATATGGATTTTTCCCTGCCACATGCACA 1040 
1434 

XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 15 60 

1570 1580 1590 1600 1610 1620 

I | I | I I I I I | | | 

12 75 

12 75 

CCGATTCAGTGCTTGGATGGTGAGGGTTTTGACATAGGAGTGTTGTCAAAGCTTTAGAGT 1100 
X434 

XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1620 



I 



1630 
• . I I 



1640 
. . I . . . 



1650 



1660 



I 



1670 
. . I . . . 



1680 



BASF_AT1 1275 

BASF_AT2 1275 

BASF-Corn GCATCTTTCGGTCAGGTGCAACAGCCTTTCGGTCATTGAGACATATAAGCGAATTAGCTA 1160 

BASF-Soy 1434 

Consensus XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1 680 



BASF_AT1 
BASF AT2 



1690 
I | I 



1700 



1710 



1720 
• • I 



1730 
. . I | , 



158 



1740 
. - I 

— 1275 
--- 1275 
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BASF-Corn TTAAAAAAAACAGAACTGTTGCATCAAAAAAAAAAAAAAAAAAGAAACAAAAAAAAAAAA 1220 

BASF- Soy 1434 

Consensus XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1740 

1750 1760 1770 1780 1790 1800 

I | | | | | I I I I I I 

BASF_AT1 1275 

BASF_AT2 1275 

BASF-Corn AAAAAAAAAAAAGAAAAAAAAAAAAAAAAAAAAAGTGCTCTGCGTTGTTACCACTGCTTG 1280 

BASF-Soy 1434 

Consensus XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1800 

1810 1820 

I | | | . 

BASF_AT1 1275 

BASF_AT2 1275 

BASF-Corn CCCTATAGTGATCGTATCAGA 1301 

BASF-Soy 14 34 

Consensus XXXXXXXXXXXXXXXXXXXXX 1821 



Table 20. ClustalW Analysis of BASF Amino Acids 

1) basf_ati (SEQ ID NO:22) 

2) basf_at2 (SEQIDNO:24) 

3) BASF-corn (SEQ ID NO:26) 

4) BASF-soy (SEQIDNO:28) 

5) Consensus (SEQ ID NO:69) 



BASF_AT1 
BASF_AT2 
BASF-Corn 
BASF- Soy 
Consensus BASF 



10 



20 
• I • - 



30 



40 



50 



60 



MEgVVGFMlgMYIFETYLDraRCMALKLPTLPKTLfflGVISQEKFEKSRAYSLDKS 

meBvvgfmiImyifetyldSrcIBalklptlpktlmgvisoekfeksraysldks 



/VGFMIHMYI FETYLDgRC 



XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 



60 
60 

1 

60 
60 



BASF_AT1 
BASF_AT2 
BASF-Corn 
BASF- Soy 




FHFVHEFVTI 



Consensus BASF XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 



BASF_AT1 
BASF_AT2 
BASF-Corn 
BASF- Soy 
Consensus BASF 



I 



130 



I 



140 
■ . I ■ . 



150 



I 



I 



160 



170 



I 



180 
■ - I 



ITDLPFSLYSTFVIE|RHGFNKQTIwaSFIRDMIKGHFLSVILGPPIVAAIl|jIVQKGGPY 

itdlpfslystfvie|rhgfnkqtiw1firdmikg !flsvilgppivaaii|ivqkggpy 
itdlpfslystfvie|rhgfnkqtiwIfirdmikg JJlsiJilgppivaaiiwivcBggpy 
itdlpfslystfvieSrhgfnkqtHwHfBrdmHkg JflsviBgppivaaiiSivqkggpy 

ITDLPFSLYSTFVIE8RHGFNKQT"w8F"RDM°KG2telLS^I°GPPIVAAIl8llVC3GGPY 



BASF_AT1 
BASF_AT2 
BASF-Corn 
BASF- Soy 
Consensus BASF 



BASF_AT1 
BASF_AT2 
BASF-Corn 
BASF-Soy 
Consensus BASF 



BASF ATI 
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BASF_AT2 
BASF-Corn 
BASF- Soy 
Consensus BASF 



BASF_AT1 
BASF__AT2 
BASF-Corn 
BASF- Soy 
Consensus BASF 




420 
420 
324 

:SC 400 

<XXXXXXXXXXXXXXXXXXXXXXX 420 



430 440 450 460 470 480 

I I I I I I I I I I I I 

BASF_AT1 KBTg 424 

BASF_AT2 KraTg 424 

BASF-Corn dBkIdSILVGLHMDFSLPHAHRFSAWMVRVLTECCQSFRVHLSVRCNSLSVIETYKRISY 384 

BASF- Soy 4 00 

Consensus BASF XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 4 80 



490 500 510 520 530 540 

I | | | | ! | | | | | | 

BASF_ATl 424 

BASF_AT2 424 

BASF-Corn KKQNCCIKKKKKKETKKKKKKKKKKKKKKKVLCVVTTACPIVIVS 429 

BASF- Soy 400 

Consensus BASF XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 525 



550 560 570 580 590 600 

| | | | | | | | I I I I 

BASF_AT1 424 

BASF_AT2 424 

BASF-Corn 429 

BASF-Soy 400 

Consensus BASF 525 



610 620 630 640 650 660 

I | | | | I I I I I I I 

BASF_AT1 424 

BASF_AT2 424 

BASF-Corn 429 

BASF- Soy 400 

Consensus BASF 525 



670 680 690 700 710 720 

| | | | | | | I | | I .... I 

BASF_ATl 424 

BASF_AT2 424 

BASF-Corn 429 

BASF-Soy 400 

Consensus BASF 525 



730 740 750 760 770 780 

I | | | | | | | I I I | 

BASF_AT1 424 

BASF_AT2 424 

BASF-Corn 429 

BASF-Soy 4 00 

Consensus BASF 525 



790 800 810 820 830 840 

| | | | | | | | ) | | | 

BASF_AT1 424 

BASF_AT2 424 

BASF-Corn 429 

BASF-Soy 4 00 

Consensus BASF 525 



850 860 870 880 890 900 

| | | | | | | | | | | | 

BASF_AT1 424 

BASF_AT2 424 

BASF-Corn 429 

BASF-Soy 4 00 

Consensus BASF 525 



160 
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910 920 930 940 950 960 

| | | | | | | I I I I | 

BASF_AT1 424 

BASF_AT2 424 

BASF-Corn 429 

BASF- Soy 400 

Consensus BASF ; 525 

970 980 990 1000 1010 1020 

I | I I | | I I I ! I I 

BASF_AT1 424 

BASF_AT2 424 

BASF-Corn 429 

BASF- Soy 400 

Consensus BASF 525 

1030 1040 1050 1060 1070 1080 

I | | | | | | j I I | .... I 

BASF_AT1 424 

BASF_AT2 424 

BASF- Corn 429 

BASF-Soy 4 00 

Consensus BASF 525 

1090 1100 1110 1120 1130 1140 

| | | | | I I I I I I .... I 

BASF_AT1 424 

BASF__AT2 424 

BASF-Corn 429 

BASF-Soy 400 

Consensus BASF 525 

1150 1160 1170 1180 1190 1200 

| | ! | | | | | | | | ! 

BASF_AT1 424 

BASF_AT2 424 

BASF-Corn 429 

BASF-Soy 400 

Consensus BASF 52 5 

1210 1220 1230 1240 1250 1260 

| | | | | | | j | | | I 

BASF_AT1 424 

BASF_AT2 424 

BASF- Corn 429 

BASF- Soy 400 

Consensus BASF 525 

1270 1280 1290 1300 1310 1320 

I | | | | | | | I | | | 

BASF_ATl 424 

BASF_AT2 424 

BASF-Corn 429 

BASF-Soy 400 

Consensus BASF 525 

1330 1340 1350 1360 1370 1380 

I | | | | I j I I I I I 

BASF_AT1 424 

BASF_AT2 424 

BASF-Corn 429 

BASF-Soy 400 

Consensus BASF 525 



Table 21. ClustalW Analysis of Generic Nucleic Acids 



1) afci (SEQIDNO:29) 

2) AT4g0i320 (SEQ ID NO:3 1 ) 

3) AF007269 (SEQ ID NO:33) 

4) Consensus (SEQ ID NO:70) 



10 20 30 40 50 60 

161 
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— i . ; . . i — i — i — t — i — i — i — i — i — i — i 

afcl 1 

AT4g01320 1 

AF007269 ATGGCGATTCCTTTCATGGAAACCGTCGTGGGTAAGCTTCAAAACCTTTTTCTGAGACAT 60 

Consensus XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 6 0 

70 80 90 100 110 120 

I | I | I | j | I | | | 

afcl 1 

AT4g01320 1 

AF007269 TTTACTATCCTGTTTCACTCATCGTATTTCGTTTTTGTTTGGGTTTTGCTTTCTGTGTTG 120 
Consensus XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 120 

130 140 150 160 170 180 

| | | | | | | | | | | | 

afcl 1 

AT4g01320 1 

AF007269 TGTGTGTTGAGATTCCATGACTCGTTTGTTTCATATACCATCGTCTCTGCTTCTCGTTTC 180 
Consensus XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 180 

190 200 210 220 230 240 

I | I I I I I I I I I | 

afcl 1 

AT4g01320 1 

AF007269 TAAATTTTGTTCTTTTCTAATAGTGCGTACCTTGATCTGAGGTTTTATTACTCCTACTAG 240 
Consensus XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 240 

250 260 270 280 290 300 

I | | | I | | | | | | | 

afcl 1 

AT4g01320 1 

AP007269 TTTCTTGTCTTACTCGTGCGTTTGATTTGATTTGAGCTTATGTGATTTCATCATCTCTTC 300 
Consensus XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 300 

310 320 330 340 350 360 

| | | | | | | | | | | | 

afcl 1 

AT4g01320 1 

AF007269 CTCGGTTTTAGAATGTACGGAGCTTCTCTGTTAACCAAAATCTAGGATTTGGGAAGAAAA 360 
Consensus XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 3 60 

370 380 390 400 410 420 

| | | | | | | | | | | | 

afcl 1 

AT4g01320 1 

AP007269 GTCGGAGTCTTTTTTTTCCTCATTCCCGATTGGAAATTGAGAATCTTGAAATTTTTCTTT 420 
Consensus XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 420 

430 440 450 460 470 480 

| | | | I | | | | | I | 

afcl 1 

AT4g01320 1 

AF007269 GTTCAAGTCATACAGCTTGAGGTTTTGGGTTTTCTTGTCAGGGTATTATTATGTTCGTGA 480 
Consensus XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 480 

490 500 510 520 530 540 

| | | | | | | | | | j | 

afcl 1 

AT4g01320 1 

AF007269 CTGCAACTAGAGTTTTCTGGAGTTTTTTGAAATGGGTTTTGTGTTGTGGAACCGTATGTG 540 
Consensus XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 540 

550 560 570 580 590 600 

| | | I I I I I I I I I 

afcl 1 

AT4g01320 1 

AF007269 AATGTTGCATCAAAACTCTTTCAGTGCTCCAATGTTTCCATCAGTAGTCAGCACAAGAGA 600 
Consensus XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 600 

610 620 630 640 650 660 

| | | | | | | | | | | | 

afcl 1 

AT4g01320 1 

AF007269 TCTTTTTATATCTGGTTGATCAAAAAAGTAGATGATGTTATTGAATTTTCAGTGATGGAG 660 
Consensus XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 660 

670 680 690 700 710 720 

162 
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— i — i — i — i — i — i . • - ^gggamam ' ' gMMm 25 

AF0072 69 TATCTGTTGTTGTGGCATTTAGAGTAGATTCGT^^^^^^^^GT^^^^^^^^TC 7 20 
Consensus XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXxSkxgkxxflxfflxxmXMXXXXXXXX 720 



afcl 

AT4gO1320 

AF007269 

Consensus 



I 



730 



I 



740 



I 



750 



760 



770 
. - I - . 



I 



780 



TCGTGGGTTTTATGATAGTGATGTACATTTTTGAGACGTATTTGGATCTGAGGCAACTCA 

t cgtg ggttttatgatagtgatgtacatttttgagacgtatttggatctgaggcaactca 
tjjjjwggttttatgatagtgatgtacatttttgagacgtatttggatctgaggcaactca 
t!SSB8ggttttatgatagtgatgtacatttttgagacgtatttggatctgaggcaactca 



85 
85 
780 
780 



afcl 

AT4gO1320 

AF007269 

Consensus 



I 



790 



I 



800 
. . I . . 



I 



810 



I 



820 
. . I . . 



I 



830 



840 



I 



I 



CTGCTCTCAAGCTTCCAACTCTCCCGAAAACCTTGGTTGGTGTAATTAGCCAAGAGAAGT 
CTGCTCTCAAGCTTCCAACTCTCCCGAAAACCTTGGTTGGTGTAATTAGCCAAGAGAAGT 
CTGCTCTCAAGCTTCCAACTCTCCCGAAAACCTTGGTTGGTGTAATTAGCCAAGAGAAGT 
CTGCTCTCAAGCTTCCAACTCTCCCGAAAACCTTGGTTGGTGTAATTAGCCAAGAGAAGT 



850 860 870 880 890 900 

I I I I • . . .1 I I I I I I I 

afcl 167 

AT4g01320 ^^^^^^^^^^^^^^ G 168 

AF007269 l^^^^^^^^^^^^^TCTTGACAAAAGGTTTCGTCTTGATCATATTTATATCA 900 
Consensus B^I^^^^^^^^^^BxXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 900 



910 920 930 940 950 960 
| | | | | | { | | .... | ... . I jl 

AF007269 ttttagttttttataattgccaggg !1$S^ 960 

Consensus XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXrajX^^^XXHgMll 9 60 



afcl 

AT4g01320 

AF007269 

Consensus 



I 



970 



I 



980 



I 



990 



I 



1000 

. . I . . . 



I 



1010 

. . j . . . 



t 



1020 



I 



ttttcactttgttcatgagtttgtaactatacttatggactctgcaattttgttctttgg 
ttttcactttgttcatgagtttgtaactatacttatggactctgcaattttgttctttgg 
ttttcactttgttcatgagtttgtaactatacttatggactctgcaattttgttctttgg 
ttttcactttgttcatgagtttgtaactatacttatggactctgcaattttgttctttgg 



1030 1040 1050 1060 1070 1080 

— I — 1 — I — I ■ . . .1 — I — I — I — I — I — I — I 
afci ^^^^^^^^^S^^3B 264 

AT4g01320 ^^^^^^^^^^^^3 285 

AF007269 ^S^^g^^^^^&i^ ^^ GTACATATCTGGTTTCGGTATACAGTATCTCATTTTGA 1080 
Consensus ^^^^^^^^^^^^^^XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1080 



1090 1100 1110 1120 1130 1140 

— i — i — i — i — i — i — i — i — i * • • HHgnBmBB 276 

AF007269 ATATAGAGTTGTTACATTACAATTGTAAAGTTTTCATTTTTACCTTAG^^^^^^^M 1140 
Consensus XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX^^^^^^^^ 1140 



afcl 

AT4g01320 

AF007269 

Consensus 



I 



1150 
. . I . . . 



1160 



1170 



I 



! 



1180 
. . I . . . 



I 



1190 1200 
. . I | | 



gttttaccgaggttgggccttgatccagagaatgaaatactgcatactctttcattcttc 
gttttaccgaggttgggccttgatccagagaatgaaatactgcatactctttcattcttg 
gttttaccgaggttgggccttgatccagagaatgaaatactgcatactctttcattcttg 
gttttaccgaggttgggccttgatccagagaatgaaatactgcatactctttcattcttg 



336 
357 
1200 
1200 



1210 1220 1230 1240 1250 1260 

— I — I — I — I — I — I — I — I — I — I — I — I 

afcl ^^^^^^P^^^^^^^S 360 

AT4gO1320 ^^g-^SS^^^^^^^S 38 1 

AF007269 L^^^M^^^^^^^ ^^^^ BgTGTTCCAAATAAACCCCTTCATATAGTCCTATACG 12 60 

Consensus iffi^^M^^^^SS^^^^XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 12 60 



1270 1280 1290 1300 1310 1320 

| | | | | | | | | | I I 

afcl 360 

AT4gO1320 381 

AF007269 TTTAGCATCAAAATATCTATTTTCTTAAGATAATAATATTTCTTTTATATTCTGATGCAG 1320 
Consensus XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1320 



1330 



1340 



1350 



1360 



1370 



1380 



163 
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afcl 

AT4g01320 

AF007269 

Consensus 



! 



i 



i 



i 



i 



I 



i 



i 



i 



ATCACTGATTTGCCATTTTCTTTGTACTCAACTTTCGTGATCGAGTCTCGGCATGGGTTC 
ATCACTGATTTGCCATTTTCTTTGTACTCAACTTTCGTGATCGAGTCTCGGCATGGGTTC 
ATCACTGATTTGCCATTTTCTTTGTACTCAACTTTCGTGATCGAGTCTCGGCATGGGTTC 
ATCACTGATTTGCCATTTTCTTTGTACTCAACTTTCGTGATCGAGTCTCGGCATGGGTTC 



420 
441 
1380 
1380 



1390 1400 1410 1420 1430 1440 

I . ■ . - I I I I I I I I I I I 

afcl IISE 426 

AT4g01320 SsiSH 447 

AFO 07 2 6 9 ^^^S GT ATGTCGTATTTCC AAC ACTACCTTGTGACTTACGTTTTTTTATCAGAGATGT 1440 
Consensus ^^^HxXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 14 40 



1450 1460 1470 1480 1490 1500 

I | | | | | . . ..1 j I I 1 | 

afcl ^^^^^^^^^^^^^i^^^ 454 

AT4g01320 ^^^^^^^^^^^^^^^^S 475 

AF007269 GGATTAAATTTGCTTCTAAATTCTGTTGAC ^ 1500 
Consensus XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXx BS**^^ 1500 



afcl 

AT4g01320 

AF007269 

Consensus 



1510 
I I . . . 



1520 
. . | . . . 



1530 1540 
. . I I I . . . 



1550 



I 



TGATCAAAGGAACATTCCTCTCTGTCATACTAGGCCCACCCATTGTTGCTGCGATAATTT 
TGATCAAAGGAACATTCCTCTCTGTCATACTAGGCCCACCCATTGTTGCTGCGATAATTT 
TGATCAAAGGAACATTCCTCTCTGTCATACTAGGCCCACCCATTGTTGCTGCGATAATTT 



:tctgtcatac 



1560 

514 
535 
1560 
1560 



1570 1580 1590 1600 1610 1620 

I I ■ ■ • • I I I I I I I I i i 

afcl WSMMSl^ 525 

AT4g01320 R^^^^^^ 54 6 

AFO 072 69 ^^^^^^^GTTTGATGATTCTGGATTCATCTTATTTCTGAGTTTTTCACATGGATGA 1620 
Consensus H^M^^^^XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1620 



1630 1640 1650 1660 1670 1680 

| | | | | | | j | | | | 

afcl 525 

AT4g01320 546 

AF007269 CTATTCTCCATTGAGTGTGAGCTTCAAAGTTTTTAGTTTTCGTGTTAAAAATTTAAAATT 1680 
Consensus XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1680 



1690 1700 1710 1720 1730 1740 

| | | | | j j I I I I I 

afcl ^^^^^^^^^^^^^^ 549 

AT4g01320 BI'^^^^^^S^^^^M 570 

AF007269 TGCTTCTCTGAGCATGAAGTTTCTATCTTTTTCCAG^ ^^^ ^^^^^^^^g^^^ 174 0 
Consensus XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX^^^^^^^^^^^^^^^^ 1740 



afcl 

AT4g01320 

AF007269 

Consensus 



1750 
I I 1 



1760 
■ ■ I ■ . . 



I 



1770 
. . I . . ■ 



1780 
■ ■ I . ■ . 



1790 



I 



TATCTGTGGGCATTCATGTTTATCCTGTCTCTAGTGATGATGACTATATACCCGGTCTTG 
TATCTGTGGGCATTCATGTTTATCCTGTCTCTAGTGATGATGACTATATACCCGGTCTTG 
TATCTGTGGGCATTCATGTTTATCCTGTCTCTAGTGATGATGACTATATACCCGGTCTTG 
TATCTGTGGGCATTCATGTTTATCCTGTCTCTAGTGATGATGACTATATACCCGGTCTTG 



1800 

609 
630 
1800 
1800 



1810 1820 1830 1840 1850 1860 

I I I 1 I I I I I I I I 

afcl ^^^^^^^^^^^^^^^^^M 639 

AT4g01320 I^^^^^^^^^^^^^^^^^^M 660 

AF007269 ^^^^^^^^^^^^^^^^^ ^^^B GTGTGTATTTCTGTCATGGCCATTTTACAA 1860 
Consensus ISI^S^SIS^SSS^BS^^^HxXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 18 60 



1870 1880 1890 1900 1910 1920 

I | | I I I I \ I I I I 

afcl 639 

AT4g01320 660 

AF007269 TTCACTGCTTGTTTGCATATGTTGTTACCAGACAATATAATCTCCCGCTTTTTTATGGCT 1920 
Consensus XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1920 



afcl 

AT4g01320 

AF007269 

Consensus 



I 



1930 
. . I . . . 



1940 1950 1960 
. . I I I I I . . . 



1970 
• • I . . . 



1980 



ATAG 
XXXX 



I 



CTTCCAGATGGAGACCTCCGGGAGAAGATTGAGAAACTTGCTTCTTCTCTAAAGTT 
CTTCCAGATGGAGACCTCCGGGAGAAGATTGAGAAACTTGCTTCTTCTCTAAAGTT 
CTTCCAGATGGAGACCTCCGGGAGAAGATTGAGAAACTTGCTTCTTCTCTAAAGTT 



695 
716 
1980 
1980 



1990 2000 2010 2020 2030 2040 

164 
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afcl 

AT4g01320 

AF007269 

Consensus 



I 



I 



I , 



I 



I 



I 



I 



I 



I 



TCCTTTGAAGAAGCTGTTTGTTGTCGATGGATCTACAAGGTCAAGCCATAGCAATG 
TCCTTTGAAGAAGCTGTTTGTTGTCGATGGATCTACAAGGTCAAGCCATAGCAATG 
TCCTTTGAAGAAGCTGTTTGTTGTCGATGGATCTACAAGGTCAAGCCATAGCAATG 
TCCTTTGAAGAAGCTGTTTGTTGTCGATGGATCTACAAGGTCAAGCCATAGCAATG8 



751 

772 

rGAG 2040 
CXXX 2040 



2050 2060 2070 2080 2090 2100 

| | | | | | | | I I | | 

afcl 751 

AT4g01320 772 

AF007269 AAGCTTGAGATCTCTTCCTACCTACTTTACTCTAGTTTACCATTAGAAGCTTACGTATCT 2100 
Consensus XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 2100 



2110 2120 2130 2140 2150 2160 
I | | I I I I .... I I I 1 I 

afci HSS^S^SS^^S^^^^^^^^^^^^^^^3B 795 

AT4g01320 ^^^^^^^^^^^^^^^^^M^^^^^^^^Sl 816 

AF0072 69 TGTTACATCATACAGG Bi[^ 2160 
Consensus XXXXXXXXXXXXXXXxB^^^^^^^^^^^^^^^tefi^^^^^^^^^M 2160 



2170 2180 2190 2200 2210 2220 

I 1 I i I I I I I I I I 

afcl ^^^^^^^^^^^ 813 

AT4g01320 ^^^^^^^^^^^^ 834 

AF007269 ^^^^^^ ^S^P^^^ GTACTGTGACTCTTGATGCTTCAAACGAGCTATACTCACATT 2220 
Consensus ^^^^^^^^^^MHxXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 2220 



2230 2240 2250 2260 2270 2280 

I | I | I I I I I t I I 

afcl B^^^^^^^^^g 829 

AT4g01320 BP^F^-^^^^M 850 

AF007269 TCTGTTTCTGGTTCTGAAACATAACATAATCTTCTATTGTGCAG^^^^^^^^^^B 2280 
Consensus xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxBB3SIS3^SKH93SH 22 80 



afcl 

AT4g01320 

AF007269 

Consensus 



I 



2290 
. . I . . . 



I 



2300 
. . I . . . 



2310 



I 



2320 
• ■ 1 - - ■ 



I 



2330 2340 
. . I h 



! 



AAATTGTGGCGGTTATTGCACACGAGCTTGGACATTGGAAACTGAATCACACTACATACT 
AAATTGTGGCGGTTATTGCACACGAGCTTGGACATTGGAAACTGAATCACACTACATACT 
AAATTGTGGCGGTTATTGCACACGAGCTTGGACATTGGAAACTGAATCACACTACATACT 
AAATTGTGGCGGTTATTGCACACGAGCTTGGACATTGGAAACTGAATCACACTACATACT 



2350 2360 2370 2380 2390 2400 

I I I . . . .1 I I I I I I I I 

afcl f^^^^^^^^^S 906 

AT4g01320 927 

AF007269 ^B^^^^^^ ^S gTGAGGCTCAACCGACAGTTCAAAAACTTACTCACATCTACAT 2400 
Consensus ^f^^S^^^^^^lixXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 24 00 



2410 2420 2430 2440 2450 2460 

I | | | | | | | I I I I 

afcl ^^^^^ 915 

AT4g01320 936 

AF007269 TTCACTTAAGAAATCATGTCTTATGACCCTCTCTCAATGTTTTGCTTGCAG^^^^^« 24 60 
Consensus XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX^^^^^S 24 60 



afcl 

AT4gO1320 

AF007269 

Consensus 



I 



2470 
. . i . . . 



I 



2480 
. . | . . . 



2490 



f • 



2500 
. . | . . . 



2510 



2520 



I 



I 



TTCTTACAATTTGGAGGATACACTCTTGTCAGAAACTCCACTGATCTCTTCAGGAGTTTC 
TTCTTACAATTTGGAGGATACACTCTTGTCAGAAACTCCACTGATCTCTTCAGGAGTTTC 
TTCTTACAATTTGGAGGATACACTCTTGTCAGAAACTCCACTGATCTCTTCAGGAGTTTC 
TTCTTACAATTTGGAGGATACACTCTTGTCAGAAACTCCACTGATCTCTTCAGGAGTTTC 



975 
996 
2520 
2520 



2530 2540 2550 2560 2570 2580 

I I I I I I I f I I I I 

afcl ^ ^ ^EMl^^^^^S^^^^^^S^M^^^^^^^^ 1020 

AT4g01320 ^^^^^^^^^^^^^^SH^^^^^^MMS^ 1041 

AF007269 ^^^^^^^^^^^^^^^^^^^^^^S^^^^S GTTTGTTATTTTTGC 2580 
Consensus ffi^^^^^^^^^^^^^^^^^^^^fe^^^^H^XXXXXXXXXXXXXX 2580 



2590 2600 2610 2620 2630 2640 

I I | I | | | .... I I I I I 

afcl 1020 

AT4g01320 1041 

AF007269 CTTTTGACACTAATCTAATGAATCAAGGATGGATTAAGAAAAAAAAACTCTAAACCTTTG 2 64 0 
Consensus XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 2640 



2650 



2660 



2670 



2680 



2690 



2700 



165 



WO 03/012116 
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I I I | I . . . . I I I i I I I 

afcl ^ ^^ S^^^^^^S^^^^^^^^^^S 1053 

AT4g01320 1074 

AF007269 GTTATATCTCCTGTCTGATTATCACAG^^^^^^^^S.^^^S^^^^^P^^B 2700 
Consensus XXXXXXXXXXXXXXXXXXXXXXXXXXxB^^^^^^^^^^^^^^^^^^^^^ 2700 



2710 2720 2730 2740 2750 2760 

I I I I | I I I I I I I 

afcl ^^^^^^^S^^SS^S^^^^^SBi 1093 

AT4g01320 ^^^^^^S^^^^^^^^^^^S^^^^^S 1114 

AF0O7269 ^^^^^^^^^^^^^^^^^^^^^^^^S TACCATCTTACAATCCCTCA 2760 
Consensus ^^^^^S^^^^^S^^^^^MK^^^^^BxXXXXXXXXXXXXXXXXXXX 27 60 



2770 2780 2790 2800 2810 2820 

I | | i | | | I I I | | 

afcl 1093 

AT4g01320 1114 

AF007269 AGATCCAACCATAGTTTCTTTATTGCAATGGCAGCCTCATCTACTAATCTGAGTTAACGT 2820 
Consensus XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 2820 



2830 2840 2850 2860 2870 2880 
I I . . . . I I I I I I [ I t j 

afci 1141 

AT4gO1320 ^^mg^^E^^^^^^^^^^^^^^^^^^l^M 1162 

Consensus XXXXXXXXXXXX^^^^^^SfflxHig^^SS^^^^^^^^^^^S^^S 2 8 80 



afcl 

AT4g01320 

AF007269 

Consensus 



I 



2890 
. . | . . . 



2900 



2910 
. . I . . . 



2920 
. . | . . . 



I 



2930 
. . | . . . 



2940 



I 



1159 
1222 
2940 

xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 2 940 



CTCTAGTGAAACTACAGG 

CTCTAGTGAAACTACAGGTCAGAGAAGATAACAACAGAACACAAACTGTTACCTCAATTT 
CTCTAGTGAAACTACAGGTCAGAGAAGATAACAACAGAACACAAACTGTTACCTCAATTT 



afcl 

AT4g01320 

AF007269 

Consensus 



2950 
. . j | 



2960 
■ ■ I • • • 



I 



2970 
. . | . . . 



I 



2980 2990 
. . I I I . . . 



AAGAGAACTTATCAGCAA 

GTGTCACACACTTAAATGGATTTTTTGTTGGGATTTTGCAGGAAGAGAACTTATCAGCAA 
GTGTCACACACTTAAATGGATTTTTTGTTGGG ATTTTGCAGG AAGAGAACTTATCAGCAA 

AAGAGAACTTATCAGCAA 



xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxg 



3000 

1177 
1282 
3000 
3000 



afcl 

AT4g01320 

AF007269 

Consensus 



I 



3010 
. . | . . . 



I 



3020 



I 



3030 
. . j . . . 



3040 



I 



I 



3050 
. . | . . . 



3060 
I I 



TGAACACTGATCCATTGgACTCAGCTTATCACTACTCACATCCTCCTCTTGTTGAAAGGC 
TGAACACTGATCCATTGTACTCAGCTTATCACTACTCACATCCTCCTCTTGTTGAAAGGC 
TGAACACTGATCCATTGTACTCAGCTTATCACTACTCACATCCTCCTCTTGTTGAAAGGC 
TGAACACTGATCCATTGSACTCAGCTTATCACTACTCACATCCTCCTCTTGTTGAAAGGC 



1237 
1342 
3060 
3060 



afcl 

AT4g01320 

AF007269 

Consensus 



3070 3080 
- • I ! I ■ . - 



3090 



I 



TTCGAGCCATTGATGGAGAAGACAAGAAGACAGATTAA 
TTCGAGCCATTGATGGAGAAGACAAGAAGACAGATTAA 
TTCGAGCCATTGATGGAGAAGACAAGAAGACAGATTAA 
TTCGAGCCATTGATGGAGAAGACAAGAAGACAGATTAA 



1275 
1380 
3098 
3098 



Table 22. ClustalW Analysis of Generic Amino Acids 



1) 
2) 
3) 
4) 



afcl 

AT4g01320 

AF007269 

Consensus 



(SEQ ID NO:30) 
(SEQ ID NO:32) 
(SEQ ID NO:34) 
(SEQ ID NO:71) 



10 20 30 40 50 60 
I I I I 1 I 1 t I I I I 

AT4g01320 SfiS^ 60 
AF007269 ^^^^^^^^^^^^^^^^^^^^^^^gjl 41 

Consensus Publi ^^^ffi^^^M^ ^^^^^^ ^^^^^ ^BBini xXXXXXXXXXXXXXXXXXXX 60 

70 80 90 100 110 120 
I | | | | | I I I I I .... I 



166 



WO 03/012116 
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AT4g01320 EN FN I C^^^^^^^^^^^^^^^m^^^^^ ^™33Hl^ ^^^^^^^^ 120 

AF007269 T 42 

Consensus Publi XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX^XXXXXXXXXXXXXXXXXXXXX 120 

130 140 150 160 170 180 

I I I I I I I i I > I I 

afcl SH^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^H 173 

AT4g01320 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ 180 

AF007269 -^^^^^^^^^^^^^^J^^^SISSSISS^^^^^^^^^^i 93 

Consensus Publi XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 180 

190 200 210 220 230 240 

I 1 I I i I I I I I 1 I 

afcl ^^^^S^^^^^M^^^^^^^^^^^^^^^^^^^^^^^^^^M 233 

AT4g01320 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ 240 

AF007269 W^^^^^^^^^^^ ^^^^^^^^^^^ ^^^^^^^SBM 153 

Consensus Publi XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 240 



afcl 

AT4g01320 
AF007269 
Consensus Publi 



I 



250 



260 



! 



! 



270 



280 



! 



I 



290 



300 



I 



I 



LKKLFVVDGSTRSSHSNAYMYGFFKNKRI VLYDTLIQQCKNEDEI VAVIAHELGHWKLNH 
LKKLFVVDGSTRSSHSNAYMYGFFKNKRI VLYDTLIQQCKNEDEIVAVI AHELGHWKLNH 
LKKLFVVDGSTRSSHSNAYMYGFFKNKRI VLYDTLIQQCKNEDEI VAVIAHELGHWKLNH 



XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 300 



310 320 330 340 350 360 
I I 1 I I 1 I I 1 I I I 

afcl ^^^^^^^^^^^S^^^^^^^^^^^^^^^^^^^^^^^^M 353 

AT4g01320 ^^^^^^^^^^ ^^^^^^^S^^^ ^M^^^^^^^^^^^^^^M 360 

AF007269 ^^^^Sr^" " - ^^^^S^^S^ 235 

Consensus Publi XXXXXXXXXXXXXXXXXXXXjgxj^XXXXXXXXXXXXXXXXXXX^XjtfflX^XXXX Wisix x 3 60 

370 380 390 400 410 420 
I I I I I I I I I I I I 

afci SS^^^^S^SI5^^S£^ 20^SEM 386 

AT4g01320 ^^^^^^^^^^^^^^^^^^^^M^^^^^^^ffl VTSICVTHLNGFFVGIL 420 
AP007269 ^^^^^^^^^^^^^^^^ ^^^^^^ ^^^W^ 278 

Consensus Publi XXXXXxHHxXXXXXXXXXXXXXXXBXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGXX 420 

430 440 450 460 470 480 
I I I I 1 I I I I I I I 

afcl 

AT4g01320 
AF007269 
Consensus Publi 



HEENLSAMNTDPLyySAYHYSHPPLVERLRAI DGEDKKTD 
HeENLSAMNTDPLYSAYHYSHPPLVERLRAI DGEDKKTD 
BeENLSAMNTDPLYSAYHYSHPPLVERLRAI DGEDKKTD 
B»S4SWlSAMNTDPLYSAYHYSHPPLVERLRAI DGEDKKTD 



Table 23. ClustalW Analysis of PPI Nucleic Acids 



1) 

2) 
3) 
4) 



PPI-AtCPP 
PPI-BnCPP 
PPI-SoyCPP 
Consensus 



(SEQ ID NO:l) 
(SEQ ID NO: 14) 
(SEQ ID NO: 17) 
(SEQ ID NO:72) 



PPI-AtCPP 
PPI-BnCPP 
PPI-SoyCPP 
Consensus 



PPI-AtCPP 
PPI-BnCPP 
PPI-SoyCPP 
Consensus 



PPI-AtCPP 
PPI-BnCPP 
PPI-SoyCPP 




130 
. . | . . 



140 



150 



160 



170 



180 
. • i 




WO 03/012116 
Consensus 



PPI-AtCPP 
PPI-BnCPP 
PPI-SoyCPP 
Consensus 



PPI-AtCPP 
PPI-BnCPP 
PPI-SoyCPP 
Consensus 



PPI-AtCPP 
PPI-BnCPP 
PPI-SoyCPP 
Consensus 



PPI-AtCPP 
PPI-BnCPP 
PPI-SoyCPP 
Consensus 



PPI-AtCPP 
PPI-BnCPP 
PPI-SoyCPP 
Consensus 



PPI-AtCPP 
PPI-BnCPP 
PPI-SoyCPP 
Consensus 



PPI-AtCPP 
PPI-BnCPP 
PPI-SoyCPP 
Consensus 



PPI-AtCPP 
PPI-BnCPP 
PPI-SoyCPP 
Consensus 



PPI-AtCPP 
PPI-BnCPP 
PPI-SoyCPP 
Consensus 



PPI-AtCPP 
PPI-BnCPP 
PPI-SoyCPP 
Consensus 



PPI-AtCPP 
PPI-BnCPP 
PPI-SoyCPP 



'TGAGAAATC 



PCT/IB02/03887 

180 



190 



I 



200 
. . I . . 



I 



210 
. - I . . 



I 



220 
. . | . . 



I 



230 
■ • I . . 



I ■ 



240 



_attttcactttgttcatgagtttgtgactatacttatggactctgcaattttgttcttt 
cattttcactttgttcatgagtttgtHactatacttatggactctgc^attQtgttcttt 

CAgTTgCAgTTTGTTCAgGAGTTTGTgACgATAgTgA^GACTCTgCAATTTTGTgCTTT 

1 ^gactctScSattStgtScttt 



a3ttmca2tttgttcamgagtttgt«ac3ata«tSA! 



240 
240 
240 
240 




370 
. • I ■ • 



380 



390 



400 
■ . I . . 



410 
I I I 



420 
• • 1 




WO 03/012116 
Consensus 



PPI-AtCPP 
PPI-BnCPP 
PPI-SoyCPP 
Consensus 



PPI-AtCPP 
PPI-BnCPP 
PPI-SoyCPP 
Consensus 



PPI-AtCPP 
PPI-BnCPP 
PPI-SoyCPP 
Consensus 



PPI-AtCPP 
PPI-BnCPP 
PPI-SoyCPP 
Consensus 



PPI-AtCPP 
PPI-BnCPP 
PPI-SoyCPP 
Consensus 



PPI-AtCPP 
PPI-BnCPP 
PPI-SoyCPP 
Consensus 



PPI-AtCPP 
PPI-BnCPP 
PPI-SoyCPP 
Consensus 



PPI-AtCPP 
PPI-BnCPP 
PPI-SoyCPP 
Consensus 



PCT/IB02/03887 



840 



850 



860 



870 



880 



890 



900 




1270 




Table 24. ClustalW Analysis of PPI Amino Acids 



1) 
2) 
3) 
4) 



PPI-AtCPP 
PPI-BnCPP 
PPI-SoyCPP 
Consensus 



PPI-AtCPP 
PPI-BnCPP 
PPI-SoyCPP 
Consensus PPI 



(SEQ ID NO:2) 
(SEQIDNO:15) 
(SEQ ID NO: 18) 
(SEQ ID NO:73) 



10 



I 



20 



30 



40 



I 



I 



50 



I 



60 
. I 



MAFPYMEAVVGFMILMYI FETYLDVRQH RALKLPTLPKTLEGVI SQEKFEKSRAYSLDKS 
MAgpgME0VVGFMI@MYgFETYLDgRQH[jALKLPTLPKTLgGVISQEKFEKSRAYSLDKS 
MAFPYMEAVVGFMILMYI FETYLDVRQH RALKLPTLPKTLEGVI SQEKFEKSRAYSLDKS 

maSpSmeSvvgfmiSmySfetyldSrqhSalklptlpktlSgvisqekfeksraysldks 



70 



80 



90 

. I . . 

169 



100 



110 



120 



WO 03/012116 



PCT/IB02/03887 



PPI-AtCPP 
PPI-BnCPP 
PPI-SoyCPP 
Consensus PPI 



HFH FVHEFVTI VTDST ILYFGVLPWFVJKKSGDF MTI A G FNA ENEI LHTLAFLAGLMIWSC 
HFHFVHEFVTlgJjDSglLgB^GjLPWFWP-lQsGgFg^EGj^gENEILHTLgFLAGLMQwSC 
HFHFVHEFVTI VTDST I LYFGVLPWFWKKSGDF MTI A G FHA ENEI LHTLAFLAGLMIWSC 
H FH F VH E F VT I S S I LS FGSL P W F W Y [5 S G>3 F3333G353 EMEILHTLSFLAG LMBw S Q 



120 
120 
120 
120 



PPI-AtCPP 
PPI-BnCPP 
PPI-SoyCPP 
Consensus PPI 



PPI-AtCPP 
PPI-BnCPP 
PPI-SoyCPP 
Consensus PPI 




I 



190 



200 



I 



210 



220 



I 



230 



I 



240 



LAI YLWVFTFGLS I VMMTLYPVLI APLFNKFTPLPDGQLREKI EKLASSLNYPLKKLFVV 
LAIYLWgFjjjFgLS@VMMT@YPVLIAPLFNKFTPLPDG|3LREKIEKLASSL|g|PLKKLFVV 
LAI YLWVFTFGLS I VMMTLYPVLI APLFNKFTPLPDGQLREKI EKLASSLNYPLKKLFVV 

laiylwBfBfSlsBvmmtHypvliaplfnkftplpdgSlrekieklassl^plkklfvv 



240 
240 
240 
240 



PPI-AtCPP 
PPI-BnCPP 
PPI-SoyCPP 
Consensus PPI 



250 



260 



270 



280 



290 



300 




310 



320 



330 



340 



350 



360 



PPI-AtCPP 
PPI-BnCPP 
PPI-SoyCPP 
Consensus PPI 



PPI-AtCPP 
PPI-BnCPP 
PPI-SoyCPP 
Consensus PPI 




430 440 450 460 470 480 

I I I I I I I I I I I I 

PPI-AtCPP g ggg 424 

PPI-BnCPP j!l5 T !3 424 

PPI-SoyCPP SjbS 424 

Consensus PPI ffl^H 480 



5 



Table XX. ClustalW Analysis of PPI/Generic Nucleic Acids 



1) PPI-AtCPP (SEQ ID NO: 1) 

2) ppi-Bncpp (SEQ ID NO: 14) 

3) ppi-soycpp (SEQ ID NO: 17) 

4) afci (SEQIDNO:29) 

5) AT4g0i32O (SEQ ID NO:3 1) 

6) AF007269 (SEQ ID NO:33) 
6) Consensus (SEQ ID NO:75) 



PPI-AtCPP NA 
PPI-BnCPP 
PPI-SoyCPP 
afcl 

AT4g01320 



10 20 30 40 50 60 70 
I | | | | j | | j | | I I | 



170 



WO 03/012116 



PCT/IB02/03887 



AF007269 
Consensus 



PPI-AtCPP NA 
PPI-BnCPP 
PPI-SoyCPP 
afcl 

AT4g01320 

AF007269 

Consensus 



PPI-AtCPP NA 
PPI-BnCPP 
PPI-SoyCPP 
afcl 

AT4g01320 

AF007269 

Consensus 



PPI-AtCPP NA 
PPI-BnCPP 
PPI-SoyCPP 
afcl 

AT4g01320 

AF007269 

Consensus 



PPI-AtCPP NA 
PPI-BnCPP 
PPI-SoyCPP 
afcl 

AT4g01320 

AF007269 

Consensus 



PPI-AtCPP NA 
PPI-BnCPP 
PPI-SoyCPP 
afcl 

AT4g01320 

AF007269 

Consensus 



PPI-AtCPP NA 
PPI-BnCPP 
PPI-SoyCPP 
afcl 

AT4g01320 

AF007269 

Consensus 



PPI-AtCPP NA 
PPI-BnCPP 
PPI-SoyCPP 
afcl 

AT4g01320 

AF007269 

Consensus 



PPI-AtCPP NA 
PPI-BnCPP 



ATGGCGATTCCTTTCATGGAAACCGTCGTGGGTAAGCTTCAAAACCTTTTTCTGAGACATTTTACTATCC 



80 90 100 110 120 130 140 

I | | | | | I I I I j 1 I 

-ATGGCGTTTCCCTACATGGAAGCCGTTGTCGGATTTATGATATTAATGTACATTTTTGAA 



•ATGGCGTTTCCCTACATGGAAGCCGTTGTCGGATTTATGATATTAATGTACATTTTTGAA 



TGTTTCACTCATCGTATTTCGTTTTTGTTTGGGTTTTGCTTTCTGTGTTGTGTGTGTTGAGATTCCATGA 
ATGGCGATTCCTTTCATGGAAACCGTCGT-GGTTTTATGATAT — ATGTACATTTTTGAA 

150 160 170 180 190 200 210 

I | | | | I I | | | | I | I 

ACTTACTTGGATG-TGCGACAACATAGGGCCCTCAAACTTCCTACTCTTCCAAAGACTTTAGAGGGTGTT 



ACTTACTTGGATG-TGCGACAACATAGGGCCCTCAAACTTCCTACTCTTCCAAAGACTTTAGAGGGTGTT 



-CTCGTTTGTTTCATATACCATCGTCTCTGCTTCTCGTTTCTAAATTTTGTTCTTTTCTAATAGTGCGTA 
— CTATTTGGAT TGGCAACATG CCTCAA — CTTCCACTCTCC AAACTTGGTGGTGTAT - 

220 230 240 250 260 270 280 

j I | | I | | | | | | | I I 

ATCAGCCAAGAGAAATTTGAGAAATCTAGAGCCTATAGTCTTGATAAAAGCCACTTCCATTTTGTTCACG 



ATCAGCCAAGAGAAATTTGAGAAATCTAGAGCCTATAGTCTTGATAAAAGCCACTTCCATTTTGTTCACG 



CCTTGATCTGAGGTTTTATTACTCCTACTAGTTTCTTGTCTTACTCGTG — CGTTT-GATTTGATTTGAG 
AGC C AAG AG AAGTTT G AG AAATC T GAG — CTACAGTCTTGAAAAAG- -CATT — CATTT-GTTCA-G 

290 300 310 320 330 340 350 

I | j I I I I | | I I I I I 

AGTTTGTGACAATAGTGACAGACTCTACAATTTTGTACTTTGGGGTATTGCCCTGGTTTTGGAA G 



AGTTTGTGACAATAGTGACAGACTCTACAATTTTGTACTTTGGGGTATTGCCCTGGTTTTGGAA G 



CTTATGTGA-TTTCATCATCTCTTCCTCGGTTTTAGAATGTACGGAGCTTCTCTGTTAACCAAAATCTAG 
AGTTTGTA — CATAGT--TAGACTCT-CAATTTTGT-CTTTGGG TTTGCCTGGTTTTGGAA G 

360 370 380 390 400 410 420 

| I | | | j | | | | | | | | 

AAATCAGGAGATTTTATGACAATAGCTGGTTTCAATGCTGAGAATGAAATACTGCATACCCTTGCCTTCT 



AAATCAGGAGATTTTATGACAATAGCTGGTTTCAATGCTGAGAATGAAATACTGCATACCCTTGCCTTCT 



GATTTGGGAAGAAAAGTCGGAGTCTTTTTTTTCCTCATTCCCGATTGGAAATTGAGAATCTTGAAATTTT 
AT-TCGGG TTTTGCAA TTGGT CATCGAGAATGAAAT-CTGCATACC-TT — CTTCT 

430 440 450 460 470 480 490 

I | | | | | | | | I I I | | 

TAGCAGGGCTGATGATTTGGTCACAGATAACAGATTTGCCCTTTTCTCTGTACTCAACTTTT GTG 



TAGCAGGGCTGATGATTTGGTCACAGATAACAGATTTGCCCTTTTCTCTGTACTCAACTTTT GTG 



TCTTTGTTCAAGTCATACAGCTTGAGGTTTTGGGTTTTCTTGTCAGGGTATTATTATGTTCGTGACTGCA 
T-GCGGT AT GAT — GGTCACAGATA — CGATTTGCCTTTTCTT — GTACTCAACTTT GTG 

500 510 520 530 540 550 560 

| | | | | I | | | | j | | | 

ATTGAGGCCCGTCATGGTTTTAATAAGCAAACAC — CATGGTTATTCTTTAGGGACATGCTTAAAGGAAT 



ATTGAGGCCCGTCATGGTTTTAATAAGCAAACAC--CATGGTTATTCTTTAGGGACATGCTTAAAGGAAT 



ACTAGAGTTTTCTGGAGTTTTTTGAAATGGGTTTTGTGTTGTGGAACCGTATGTGAATGTTGCATCAAAA 
AT — GAGTCCG-CATGGTTAAAACAAACA CATGGTT TCTTAGGGACATG— TAAAGGAAT 

570 580 590 600 610 620 630 

I | | | | j ) I I I I I | I 

TTTCCTTTCTGTAATAATTGGTCCACCTATTGTGGCTGCAA TCATTGTAA-TAGTACAGAAAGG 



171 



WO 03/012116 



PCT/IB02/03887 



PPI-SoyCPP 
afcl 

AT4g01320 

AF007269 

Consensus 



TTTCCTTTCTGTAATAATTGGTCCACCTATTGTGGCTGCAA- 



- TC AT TGTAA - T AGT AC AGAAAGG 



CTCTTTCAGTGCTCCAATGTTTCCATCAGTAGTCAGCACAAGAGATCTTTTTATATCTGGTTGATCAAAA 
TTCCTTCTGTATA G — CC-CCTATTGTG-CTGCAA T-ATTGTA — TAGT-C AGAAAGG 



PPI-AtCPP NA 
PPI-BnCPP 
PPI-SoyCPP 
afcl 

AT4g01320 

AF007269 

Consensus 



640 650 660 670 680 690 700 

| | | | | | | | I I | | | 1 

AGGTCCATA — CTTGGCCATCTATCTTTGGGTTTTTACGTTTGGTCTTTCTATTGTGATGATGACCCTT 

AGGTCCATA — CTTGGCCATCTATCTTTGGGTTTTTACGTTTGGTCTTTCTATTGTGATGATGACCCTTE 



AAGTAGATGATGTTATTGAATTTTCAGTGATGGAGTATCTGTTGTTGTGGCATTTAGAGTAGATTCGTS 
AGGTCC TATG-CCATCTATCTTGGG TTTAGTTTTCTTCTTGTGATGATGACC — T| 



PPI-AtCPP NA 
PPI-BnCPP 
PPI-SoyCPP 
afcl 

AT4g01320 

AF007269 

Consensus 



PPI-AtCPP NA 
PPI-BnCPP 
PPI-SoyCPP 
afcl 

AT4g01320 

AF007269 

Consensus 



PPI-AtCPP NA 
PPI-BnCPP 
PPI-SoyCPP 
afcl 

AT4g01320 

AF007269 

Consensus 



PPI-AtCPP NA 
PPI-BnCPP 
PPI-SoyCPP 
afcl 

AT4g01320 

AF007269 

Consensus 



PPI-AtCPP NA 
PPI-BnCPP 
PPI-SoyCPP 
afcl 

AT4g01320 

AF007269 

Consensus 



PPI-AtCPP NA 
PPI-BnCPP 
PPI-SoyCPP 
afcl 

AT4g01320 

AF007269 

Consensus 



770 
I 

.GGGAGAA 
IGTATTTGG 
.GGGAGAA 
IGTATTTGG 
GTATTTGG 
IGTATTTGG 
GGGAGAA 




990 

I I 

GIGGACACr 
3ATGAGT 
3GGACAC 
CATGAGT 
3 AT G AGT 
CATGAGT 

gSgggacacj 

1060 



1070 



ttgga 


£ 


G 


ttgga 


5 


G 


ttgga 




G 


TTGGA 


3 


G 


TTGGA 


i 


G 


TTGGA 


1 


G 


TTGGA 


3 


G 



1080 1090 1100 1110 1120 

... 1 I | | I I I | I 

kTATACACTAGTGCGAAATTCAGCTGATCTGTATCGAAGCTTTGGGTTTGATACGCAGCCAGT 

lTATACACTAGTGCGAAATTCAGCTGATCTGTATCGAAGCTTTGGGTTTGATACGCAGCCAGT 



iTACATATCTGGTTTCGGTATACAGTATCT-CATTTTGAATATAGAGTTGTTACATTACAA — 
iTACAC-CTAGTG — AAATCC TGATCT TGAG TTGGTTTGATAC-CAGCCG — 
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Table XX. ClustalW Analysis of PPI/Generic Nucleic Acids 

1) PPI-AtCPP (SEQ ID NO: 1) 

2) PPI-BnCPP (SEQ ID NO: 14) 

3) PPI-SoyCPP (SEQIDNO:17) 

4) afcl (SEQ ID NO:29) 

5) AT4 g 0l320 (SEQIDNO:31) 

6) AF007269 (SEQ ID NO:33) 
6) Consensus (SEQ ID NO:75) 




tcctcttgtwgagaggcttcgagccattgatggagaagacaagaagacagattaa 
tcctcttgtBgaBaggcttcgagccattgatggagaagacaagaagacagattaa 
tcctcttgtBgaHaggcttcgagccattgatggagaagacaagaagacagattaa 
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Table 26. ClustalW Analysis of PPI/Generic Amino Acids 
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What is claimed is: 



1 . A method of producing a transgenic plant, comprising introducing into a plant 
cell a compound that increases prenyl protease expression or activity to generate 
a transgenic cell; and regenerating a transgenic plant from said transgenic cell. 

2. The method of claim 1 , wherein said plant has an altered phenotype selected from 
the group consisting of increased tolerance to stress, delayed senescence, 
increased ABA sensitivity, increased yield, increased productivity and increased 
biomass compared to a wild type plant. 

3. The method of claim 1, wherein said compound comprises a nucleic acid 
sequence encoding prenyl protease. 

4. The method of claim 3, wherein said nucleic acid comprises SEQ ID NO: 1,14, 
17, 68, 70, 72, 74, 21, 23, 25, 27, 29, 31, or 33. 

5. The method of claim 3, wherein said nucleic acid is operably linked to a 
promotor. 

6. The method of claim 5, wherein said promoter is selected from the group 
consisting of a constitutive promoter, an ABA inducible promoter, tissue specific 
promoters or a guard cell-specific promoter 

7. The method of claim 1, wherein said compound is a prenyl protease polypeptide 
or fragment thereof. 

8. The method of claim 7, wherein said prenyl protease polypeptide comprises the 
amino acid sequence of SEQ ID NO: 2, 15, 18, 22, 24, 26, 28, 30, 32, 34, 69, 71, 
73, or 75. 

9. The transgenic plant produced by claim 1 . 
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10. The seed produced by the transgenic plant of claim 9, wherein said seed produces 
a plant that has an altered phenotype selected from the group consisting of 
increased tolerance to stress, delayed senescence, increased ABA sensitivity, 
increased yield, increased productivity and increased biomass compared to a 
wild type plant. 

11. A method of producing a transgenic plant, comprising introducing into a plant 
cell a nucleic acid that inhibits prenyl protease expression or activity to generate a 
transgenic cell; and regenerating a transgenic plant from said transgenic cell. 

12. The method of claim 1 1 , wherein said plant has an altered phenotype selected 
from the group consisting of increased tolerance to stress, delayed senescence, 
increased ABA sensitivity, increased yield, increased productivity and increased 
biomass compared to a wild type plant. 

13. The method of claim 11, wherein said nucleic acid comprises an antisense 
nucleic acid sequence encoding prenyl protease. 

14. The method of claim 13, wherein said antisense nucleic acid comprises 20 or 
more consecutive nucleic acids complementary to SEQ ID NO: 1, 14, 17, 21, 23, 
25, 27, 29, 31, 33, 68, 70, 72, or 74. 

15. The method of claim 13, wherein said antisense nucleic acid comprises SEQ ID 
NO: 16, 19, 20, 5, 35, 37, 38, 42, 43, 45 , 46, 48, 49, 51, or 52. 

1 6 The method of claim 1 1 , wherein said nucleic acid is operably linked to a 
promotor. 

17. The method of claim 16, wherein said promoter is selected from the group 
consisting of a constitutive promoter, an ABA inducible promoter, tissue specific 
promoters or a guard cell-specific promoter 

1 8. The method of claim 1 1 , wherein the nucleic acid is an inhibitor of farnesylation. 
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19 The transgenic plant produced by any one of the methods of claims 1 1 . 



20. The seed produced by the transgenic plant of claim 19, wherein said seed 
produces a plant that has an altered phenotype selected from the group consisting 
of increased tolerance to stress, delayed senescence, increased ABA sensitivity, 
increased yield, increased productivity and increased biomass compared to a 
wild type plant. 

21 . A method of producing a transgenic plant, comprising introducing into a plant 
cell a nucleic acid selected from the group consisting of SEQ ID NO: 16, 19, 20, 
5, 35, 37, 38, 42, 43, 45 , 46, 48, 49, 51, and 52 to generate a transgenic cell; and 
regenerating a transgenic plant from said transgenic cell. 

22. An isolated polypeptide comprising the mature form of an amino acid sequenced 
selected from the group consisting of SEQ ID NO: 2, 15, 18, 69, 71, 73 and 75. 

23 An isolated polypeptide comprising an amino acid sequence selected from the 
group consisting of SEQ ID NO: 2, 15, 18, 69, 71, 73 and 75. 

24. An isolated polypeptide comprising an amino acid sequence which is at least 
96% identical to an amino acid sequence selected from the group consisting of 
SEQ ID NO: 15 

25. An isolated polypeptide comprising an amino acid sequence which is at least 
99% identical to an amino acid sequence selected from the group consisting of 
SEQ ID NO: 2, 15, 18 

26 The polypeptide of claim 24, wherein said polypeptide has prenyl pretease 
activity. 

27 An isolated polypeptide, wherein the polypeptide comprises an amino acid 
sequence comprising one or more conservative substitutions in the amino acid 
sequence selected from the group consisting of SEQ ID NO: 2, 15, and 18. 
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28. The polypeptide of claim 23, wherein said polypeptide is naturally occurring. 



29. An isolated nucleic acid molecule comprising a nucleic acid sequence selected 
from the group consisting of SEQ ID NO: 1, 14, 17, 20, 16, 19, 68, 70, 72, and 
74. 

30. The nucleic acid molecule of claim 29, wherein the nucleic acid molecule is 
naturally occurring. 

31. An isolated nucleic acid molecule encoding the mature form of a polypeptide 
having an amino acid sequence selected from the group consisting of SEQ ID 
NO: SEQ ID NO: 2, 15, and 18. 

32. An isolated nucleic acid molecule, wherein said nucleic acid molecule hybridizes 
under stringent conditions to the nucleotide sequence selected from the group 
consisting of SEQ ID NO: 1, 14, 17, 20, 16, 19, 68, 70, 72, and 74. 

33. An isolated nucleic acid molecule comprising a nucleotide sequence which is at 
least 94% identical to the nucleotide sequence selected from the group consisting 
of SEQ ID NO: 17, 18 and 19. 

34. An isolated nucleic acid molecule comprising a nucleotide sequence which is at 
least 99% identical to the nucleotide sequence selected from the group consisting 
of SEQ ID NO: 1, 14, 17, 20, 16, 19. 

35. A vector comprising the nucleic acid molecule of claim 29. 

36. The vector of claim 35, further comprising a promoter operably linked to said 
nucleic acid molecule. 

37. A cell comprising the vector of claim 36. 
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38. An antibody that immunospecifically binds to the polypeptide of claim 22. 



39. The antibody of claim 38, wherein the antibody is a monoclonal antibody. 

40. The antibody of claim 37, wherein the antibody is a polyclonal antibody. 

41 . A method of identifying an agent that binds to the polypeptide of claim 27, the 
method comprising: 

(a) introducing said polypeptide to said agent; and 

(b) determining whether said agent binds to said polypeptide. 

42. The method of claim 41, wherein the agent is a farnesylation inhibitor. 

43. A method for identifying farnesylation modulator, the method comprising: 

(a) providing a cell expressing the polypeptide of claim 22; 

(b) contacting the cell with a candidate substance; and 

(c) determining whether the substance alters farnesylation activity; 

whereby, if an alteration observed in the presence of the substance is not 
observed when the cell is contacted with a composition in the absence of the 
substance, the substance is identified as a farnesylation modulator. 

44. A method for identifying an interacting gene of prenyl protease, the method 
comprising: 

a) providing the transgenic plant of claim 1 ; 

b) creating a library of mutagenized plants from (a); 

c) determining whether the mutagenized plant contains an altered 
phenotype; 
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whereby, the mutagenized plant has altered the function of an interacting gene of 
prenyl protease which results in an altered phenotype from the transgenic plant of 
(a) to that of a wild type non-transgenic plant. 



45. A plant, wherein a mutation has been introduced in the gene encoding prenyl 
protease, resulting in said plant displaying altered prenyl protease activity and an 
increased tolerance to stress as compared to a wild type plant. 
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